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WO 99/58558 PCT/US99/10567 

CELL SIGNALING PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of cell signaling proteins 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative 

and inflammatory disorders. t ' 

' i ' 

i 

BACKGROUND OF THE INVENTION 

10 Signal transduction is the process of biochemical events by which cells respond to 

extracellular signals. Extracellular signals are transduced through a biochemical cascade that 
begins with the binding of a signal molecule such as a hormone, neurotransmitter, or growth 
factor, to a cell membrane receptor and ends with the activation of an intracellular target molecule. 
The process of signal transduction regulates a wide variety of cell functions including cell » 

15 proliferation, differentiation, and gene transcription. , 

Signal transduction is the general process by which cells respond to extracellular signals 
(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of the signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in this 

20 process involve the activation of various cytoplasmic proteins by phosphorylation via protein 

i 

kinases and the eventual translocation of some of these activated proteins to the cell nucleus where 
the transcription of specific genes is triggered. Thus, the signal transduction process regulates all 
types of cell functions including cell proliferation, differentiation, and gene transcription. 

Protein kinases play a key role in the signal transduction process by phosphorylating and 

25 activating various proteins involved in signaling pathways. The high energy phosphate which 
drives this activation is generally transferred from adenosine triphosphate molecules (ATP) to a 
particular protein by protein kinases and removed from that protein by protein phosphatases. 
Phosphorylation occurs in response to extracellular signals, cell cycle checkpoints, and 
environmental or nutritional stresses. Protein kinases are roughly divided into two groups; those 

30 that phosphorylate tyrosine residues (protein tyrosine kinases, PTK) and those that phosphorylate 
serine or threonine residues (serine/threonine kinases, STK). A few protein kinases have dual 
specificity for serine/threonine and tyrosine residues. Almost all kinases contain a similar 250-300 
amino acid catalytic domain containing specific residues and sequence motifs characteristic of the 
kinase family. (Hardie, G. and Hanks, S. (1995) The Protein Kinase Facts Books. Vol 1:7-20 
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Academic Press, San Diego, CA.) 

STKs include the second messenger dependent protein kinases such as the cyclic-AMP 
dependent protein kinases (PKA), which are involved in mediating hormone-induced cellular 
responses; calcium-calmodulin (CaM) dependent protein kinases, which are involved in regulation 
5 of smooth muscle contraction, glycogen breakdown, and neurotransmission; and the mitogen- 
activated protein kinases (MAP) which mediate signal transduction from the cell surface to the 
nucleus via phosphorylation cascades. Altered PKA expression is implicated in a variety of 
disorders and diseases including cancer, thyroid disorders, diabetes, atherosclerosis, and 
cardiovascular disease. (Isselbacher, K.J. et al. (1994) Harrison's Principles of Internal Medicine 
1 0 McGraw-Hill, New York, NY, pp. 4 1 6-43 1 , 1 887.) 

PTKs are divided into transmembrane, receptor PTKs and nontransmembrane, non- 
receptor PTKs. Transmembrane protein-tyrosine kinases are receptors for most growth factors 
which include epidermal GF, platelet-derived GF, fibroblast GF, hepatocyte GF, insulin and 
insulin-like GFs, nerve GF, vascular endothelial GF, and macrophage colony stimulating factor. 
1 5 Non-receptor PTKs lack transmembrane regions and, instead, form complexes with the 

intracellular regions of cell surface receptors. Receptors that function through non-receptor PTKs 
include those for cytokines, hormones (growth hormone and prolactin) and antigen-specific 
receptors on T and B lymphocytes. 

Many of these PTKs were first identified as the products of mutant oncogenes in cancer 
20 cells where their activation was no longer subject to normal cellular controls. In fact, about one 
third of the known oncogenes encode PTKs, and it is well known that cellular transformation 
(oncogenesis) is often accompanied by increased tyrosine phosphorylation activity. (Charbonneau 
H and Tonks NK (1992) Annu Rev Cell Brol 8:463-493.) 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 
25 from molecules previously activated by kinases. The two principle categories of protein 

phosphatases are the protein phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses in cells. (Cohen, P. (1989) Annu. Rev. Biochem. 58:453- 
508.) PTPs reverse the effects of protein tyrosine kinases and play a significant role in cell cycle 
30 and cell signaling processes. (Charbonneau and Tonks, supra,) In the process of cell division, for 
example, a specific PTP (M-phase inducer phosphatase) plays a key role in the induction of 
mitosis by dephosphorylating and activating a specific PTK (CDC2) leading to cell division. 
(Sadu, K.. etal. (1990) Proc. Natl. Acad. Sci. 87:5139-5143.) 

Guanine nucleotide binding proteins (GTP-binding proteins) are critical mediators of the 
35 signal transduction pathway. Extracellular ligands such as hormones, growth factors, 
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neuromodulators, or other signaling molecules bind to transmembrane receptors, and the signal is 
propagated to effector molecules by intracellular signal transducing proteins. Many of these signal 
transduction proteins are GTP-binding proteins which regulate intracellular signaling pathways. 
GTP-binding proteins participate in a wide range of other regulatory functions including 
5 metabolism, growth, differentiation, cytoskeletal organization, and intracellular vesicle transport 
and secretion. Exchange of bound GDP for GTP followed by hydrolysis of GTP to GDP provides 
the energy that enables GTP-binding proteins to alter their conformation and interact with other 
cellular components. Two structurally distinct classes of GTP-binding proteins are recognized: 
heterotrimeric GTP-binding proteins, consisting of three different subunits, and monomelic, low 

10 molecular weight (LMW), GTP-binding proteins consisting of a single polypeptide chain. 

G protein coupled receptors (GPCR) are a superfamily of integral membrane proteins 
which transduce extracellular signals. GPCRs include receptors for biogenic amines, mediators of 
inflammation, peptide hormones, and sensory signal mediators. A GPCR becomes activated when 
the receptor binds to its extracellular ligand. The beta subunit of the GPCR, which consists of an 

15 amino- terminal helical segment followed by seven WD, or J} transducin repeats, transduces signals 
across the plasma membrane. Conformational changes in the GPCR, resulting from the iigand- 
receptor interaction, promote the binding of GTP to the GPCR intracellular domains. GTP binding 
to the GPCR leads to the interaction of the GPCR alpha subunit with adenylate cyclase or other 
second messenger molecule generators. This interaction regulates the activity of second 

20 messenger molecules such as cAIVlP, cGMP, or eicosinoids which, in turn, regulate 

phosphorylation and activation of other intracellular proteins. The GPCR changes conformation 
upon hydrolysis of the bound GTP by GTPases, dissociates from the second messenger molecule 
generator, and returns to its initial pre-ligand binding conformation. 

G beta proteins, also known as P transducins, contain seven tandem repeats of the WD- 

25 repeat sequence motif, a motif found in many proteins with regulatory functions. WD-repeat 
proteins contain from four to eight copies of a loosely conserved repeat of approximately 40 
amino acids which participates in protein-protein interactions. Mutations and variant expression 
of $ transducin proteins are linked with various disorders. Mutations in LIS1, a subunit of the 
human platelet activating factor acetylhydrolase, cause Miller-Dieker lissencephaly. RACK1 

30 binds activated protein kinase C, and RbAp48 binds retinoblastoma protein. CstF is required for 
polyadenylation of mammalian pre-mRNA in vitro and associates with subunits of cleavage- 
stimulating factor. CD4, an integral membrane glycoprotein which functions as an HIV co- 
receptor for infection of human host cells is degraded by HIV-encoded Vpu in the endoplasmic 
reticulum. WD repeats of human beta TrCP molecule mediate the formation of the CD4- Vpu, 

35 inducing CD4 proteolysis (Neer, E.J. et al. (1994) Nature 371 :297-300 and Margottin, F. et al. 
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(1998) Mol. Cell. 1 :565-574). 

Irregularities in the GPCR signaling cascade may result in abnormal activation of 
leukocytes and lymphocytes, leading to the tissue damage and destruction seen in many 
inflammatory and autoimmune diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic 
5 anemia, lupus erythematosus, and thyroiditis. Abnormal cell proliferation, including cyclic AMP 
stimulation of brain, thyroid, adrenal, and gonadal tissue proliferation is regulated by G proteins. 
Mutations in p a subunits have been found in grbwth-hormone-secreting pituitary somatotroph 
tumors, hyperfunctioning thyroid adenomas, and ovarian and adrenal neoplasms (Meij, J.TA. 
(1996) Mol. Cell. Biochem. 157:31-38; Aussel, C. et al. (1988) J. Immunol. 140:215-220). 
10 LMW GTP-binding proteins regulate cell growth, cell cycle control, protein secretion, and 

intracellular vesicle interaction. They consist of single polypeptides which, like the alpha subunit 
of the heterotrimeric GTP-binding proteins, are able to bind to and hydrolyze GTP, thus cycling 
between an inactive and an active state. LMW GTP-binding proteins respond to extracellular 
signals from receptors and activating proteins by transducing mitogenic signals involved in 
1 5 various cell functions. The binding and hydrolysis of GTP regulates the response of LMW GTP- 
binding proteins and acts as an energy source during this process (Bokoch, G. M. and Der, C. J. 
(1993) FASEB J. 7:750-759). 

At least sixty members of the LMW GTP-binding protein superfamily have been 
identified and are currently grouped into the four subfamilies of ras, rho, arf, sari, ran, and rab. 
20 Activated ras genes were initially found in human cancers and subsequent studies confirmed that 
ras function is critical in determining whether cells continue to grow or become differentiated. 
Other members of the LMW G-protein superfamily have roles in signal transduction that vary with 
the function of the activated genes and the locations of the GTP-binding proteins that initiate the 
activity. Rho GTP-binding proteins control signal transduction pathways that link growth factor 
25 receptors to actin polymerization, which is necessary for normal cellular growth and division. The 
rab, arf, and sari families of proteins control the translocation of vesicles to and from membranes 
for protein localization, protein processing, and secretion. Ran GTP-binding proteins are located 
in the nucleus of cells and have a key role in nuclear protein import, the control of DNA synthesis, 
and cell-cycle progression (Hall, A. (1990) Science 249:635-640; Barbacid, M. (1987) Ann. Rev 
30 Biochem. 56:779-827; and Sasaki, T. and Takai, Y. (1998) Biochem. Biophys. Res. Commun. 
245:641-645). 

LMW GTP-binding proteins are GTPases which cycle between a GTP-bound active form 
and a GDP-bound inactive form. This cycle is regulated by proteins that affect GDP dissociation, 
GTP association, or the rate of GTP hydrolysis. Proteins affecting GDP association are 
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represented by guanine nucleotide dissociation inhibitors and guanine nucleotide exchange factors 
(GEP). The best characterized is the mammalian homologue of the Drosophila Son-of-Sevenless 
protein. Proteins affecting GTP hydrolysis are exemplified by GTPase-activating proteins (GAP). 
Both GEP and GAP activity may be controlled in response to extracellular stimuli and modulated 

5 by accessory proteins such as RalBPl and POB1. The GDP-bound form is converted to the 
GTP-bound form through a GDP/GTP exchange reaction facilitated by guanine 
nucleotide-releasing factors. The GTP-bound form is converted to the GDP-bound form by 
intrinsic GTPase activity, and the conversion is accelerated by GAP (Ikeda, M. et al. (1998) J. 
Biol. Chem. 273:81 4-82 l;Quilliam, L. A. (1995) Bioessays 17:395-404.). Mutant Ras-family 
10 proteins, which bind but caA not hydrolyze GTP, are permanently activated, and cause cell 

proliferation or-cancer,-as-do GEP^that activate LMW GTP-binding proteins (Drivas, G. T. et al. 

(1990) MoL Cell. Biol. 10:1793-1798; and Whitehead, I. P. et al. (1998) Mol Cell Biol. 
18:4689-4697.) 

The discovery nf new cell signaling proteins. and-the-polynucleotides-encoding them 

15 satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell proliferative and inflammatory disorders. 

SUMMARY OF THE INVENTION 

20 The invention features substantially purified polypeptides, cell signaling proteins, referred 

to collectively as "CSIGP" and individually as CSIGP-1 through CSIGP-13 . In one aspect, the 
invention provides a substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof 

The invention further provides a substantially purified variant having at least 90% amino 

25 acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof. The invention also provides an isolated and purified 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-13, and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 

30 to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1 - 1 3, and fragments thereof. 

Additionally, the invention provides an isolated and purified polynucleotide which 
hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments 

35 thereof. The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 

-5- 
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acid sequence selected from the group consisting of SEQ ID NO: I- 13, and fragments thereof. 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 14-26, and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at 
5 least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 
group consisting of SEQ ID NO: 14-26 and' fragments thereof. The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 14-26 and fragments thereof. 
10 The invention also provides a method for detecting a polynucleotide in a sample 

containing nucleic acids, the method comprising the steps of (a) hybridizing the complement of the 
polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide'™ the sample. In one 
1 5 aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 

The invention further provides an expression vector containing at least a fragment of the 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-1 3, and fragments thereof. In another aspect, the expression 
vector is contained within a host cell. 
20 The invention also provides a method for producing a polypeptide, the method comprising 

the steps of: (a) culturing the host cell containing an expression vector containing at least a 
fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
25 purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO:l-13, and fragments thereof. The invention also 
provides a purified agonist and a purified antagonist to the polypeptide. 
30 The invention also provides a method for treating or preventing a disorder associated with 

decreased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1- 13, and fragments thereof, in conjunction with a suitable 
35 pharmaceutical carrier. 

-6- 
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I 

The invention also provides a method for treating or preventing a disorder associated with 

i 

increased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof, in 

5 conjunction with a suitable pharmaceutical carrier. 

< * . 

BRIEF DESCRIPTION OF THE TABLES ' * 

Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 
clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 

! 

10 full-length sequences encoding CSIGP. 

Table 2 shows features of each polypeptide sequence including potential motifs, 

i . i 

homologous sequences, and methods and algorithms used for identification of CSIGP. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, diseases, disorders or conditions associated with these tissues, 
15 and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte 
cDNA clones encoding CSIGP were isolated. 

Table 5 shows the programs, their descriptions, references, and threshold parameters used 
to analyze CSIGP. 

i 

20 i 

DESCRIPTION OF THE INVENTION f 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
25 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
30 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
35 herein can be used to practice or test the present invention, the preferred machines, materials and 

-7- 
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methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
5 prior invention. 
DEFINITIONS 

"CSIGP" refers to the amino acid sequences of substantially purified CSIGP obtained 
from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 
10 semi-synthetic, or recombinant ' 

The term "agonist" refers to a molecule which, when bound to CSIGP, increases or 

prolongs the duration of the effect of CSIGP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of CSIGP. 

An " a,,el »c variant" is analternative form of the gene encoding CSIGP. Allelic variants 

15 may result from at least one mutation in the nucleic acid sequence^nd may result in altered 
mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 
20 with the others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding CSIGP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polynucleotide the same as 
CSIGP or a polypeptide with at least one functional characteristic of CSIGP. Included within this 
definition are polymorphisms which may or may not be readily detectable using a particular 
25 oligonucleotide probe of the polynucleotide encoding CSIGP, and improper or unexpected 
hybridization to allelic variants, with a locus other than the normal chromosomal locus for the 
polynucleotide sequence encoding CSIGP. The encoded protein may also be "altered," and may 
contain deletions, insertions, or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent CSIGP. Deliberate amino acid substitutions may be made 
30 on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues, as long as the biological or immunological activity of CSIGP is 
retained. For example, negatively charged amino acids may include aspartic acid and glutamic 
acid, positively charged amino acids may include lysine and arginine, and amino acids with 
uncharged polar head groups having similar hydrophilicity values may include leucine, isoleucine, 
35 and valine; glycine and alanine; asparagine and glutamine: serine and threonine; and 
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phenylalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally .occurring or 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
5 fragments" refer to fragments of CSIGP which are preferably at least 5 to about 1 5 amino acids in 
length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of CSIGP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
10 with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. > 

The term "antagonist" refers to a molecule which, when bound to CSIGP, decreases the 
15 amount or the duration of the effect of the biological or immunological activity of CSIGP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of CSIGP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such as 
Fab, FCab*)?, and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 
20 that bind CSIGP polypeptides can be prepared using intact polypeptides or using fragments 

containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, 
25 and keyhole limpet hemocyanin (KJLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 
30 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
may be produced by any method including synthesis or transcription. Once introduced into a cell, 
35 the complementary nucleotides combine with natural sequences produced by the cell to form 
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duplexes and to block either transcription or translation. The designation "negative" can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active," refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active** 
5 refers to the capability of the natural, recombinant, or synthetic CSIGP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 1 . 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the 
10 complementary sequence '"3/ T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 
of the hybridization between the nucleic acid strands. This is of particular importance in 
15 amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
20 aqueous solution. Compositions comprising polynucleotide sequences encoding CSIGP or 
fragments of CSIGP may be employed as hybridization probes. The probes may be stored, in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCI), 
detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardfs solution, 
25 dry milk, salmon sperm DNA, etc.). 

"Consensus sequence"refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using XL-PCR kit (Perkin-Elroer, Norwalk CT) in the 5' and/or 
the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of 
more than one Incyte Clone using a computer program for fragment assembly, such as the 
30 GELVIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 
the presence of nucleic acids, the same or related to a nucleic acid sequence encoding CSIGP, by 
northern analysis is indicative of the presence of nucleic acids encoding CSIGP in a sample, and 
35 thereby correlates with expression of the transcript from the polynucleotide encoding CSIGP. 
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A "deletion'Vefers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
5 example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived f . 

10 The term "similarity" refers to a degree, of complementarity. There may be partial 

similarity or complete similarity. The word "identity" may substitute for the word "similarity." A 
partially complementary sequence that at least partially inhibits an identical sequence from 
hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 
hybridization of the completely complementary sequence to the target sequence may be examined 

15 using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 
sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is perrriitted, as reduced stringency conditions 

20 require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence Which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 
In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

25 The phrases "percent identity" or "% identity" refer to the percentage of sequence 

similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 
Madison WI). The MEGALIGN program can create alignments between two or more sequences 
according to different methods, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp 

30 (1988) Gene 73:237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 
dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
number of gap residues in sequence B, into the sum of the residue matches between sequence A 

35 and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
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sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 
.Hein method. (See, e.g., Hein, J. (1990) Methods Enzymoi. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

i 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization*' refers to any process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" or f "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
The terms "element" or "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of CSIGP. For example, 
modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
other biological, functional, or immunological properties of CSIGP. 

The phrases "nucleic acid" or "nucleic acid sequence" refer to a nucleotide, 
oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to DNA or 
RNA of genomic or synthetic origin which may be single-stranded or double-stranded and may 
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represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or 
RNA-like material. In this context, "fragments" refers to those nucleic acid sequences which, 
when translated, would produce polypeptides retaining some functional characteristic, e.g., 
antigenicity, or structural domain characteristic, e.g., ATP-binding site, of the full-length 
5 polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related nucleic 
acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 
10 genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 
polypeptide but^still-bind-tcM3peratQr-sequences4hat control exp ressio n of th e pol ypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about IS to 30 nucleotides, and most preferably about 20 
to 25 nucleotides, which can bejused in FCR amplification o r in a hy bridization assay or 
15 microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," 
"oligomer," and "probe," as these terms are commonly defined in the art. ' 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
20 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding CSIGP, or fragments thereof, or CSIGP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
25 DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" or "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
30 presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 

reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
35 concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
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conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
5 removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 

which they are naturally associated. ' i 

f 

A "substitution" refers to the replacement of on'e or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively. 
10 "Substrate" refers to ( any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 

15 recipient cell. Transformation may occur under natural or artificial conditions according to 1 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 
transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 

20 The term "transformed" cells includes stably transformed celjs in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome as well as transiently transformed cells which express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of CSIGP polypeptides refers to an amino acid sequence that is altered by one 
25 or more amino acid residues. The variant may have "conservative" changes, wherein a substituted 
amino acid has similar structural or chemical properties (e.g., replacement of leucine with 
isoleucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement of 
glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 
30 inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 

The term "variant," when used in the context of a polynucleotide sequence, may 
encompass a polynucleotide sequence related to CSIGP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
35 variant may have significant identity to a reference molecule, but will generally have a greater or 
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lesser number of polynucleotides due to alternate splicing of exons during mRN A processing. The 
corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 
5 polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. , 
10 THE INVENTION 1 , 

The invention is based on the discovery of new human cell signaling proteins (CSIGP), 
the polynucleotides encoding CSIGP, and the use of these compositions for the diagnosis, 

i 

treatment, or prevention of cell proliferative and inflammatory disorders. 

Table 1 lists the Incyte Clones used to derive full length nucleotide sequences encoding 

15 CSIGP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clone 
in which nucleic acids encoding each CSIGP were first identified, and column 4, the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones, their 
corresponding cDNA libraries, and shotgun sequences useful as fragments in hybridization 

20 technologies, and which are part of the consensus nucleotide sequence of each CSIGP. 

The columns of Table 2 show various properties of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4, potential glycosylation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, 

25 homologous sequences; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and disease-association of nucleotide 
sequences encoding CSIGP. The first column of Table 3 lists the polynucleotide sequence 
identifiers. The second column lists tissue categories which express CSIGP as a fraction of total 

30 tissue categories expressing CSIGP. The third column lists diseases, disorders, and conditiond 
associated with those tissues expressing CSIGP. The fourth column lists the vectors used to 
subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding CSIGP are useful in 
hybridization or amplification technologies to identify SEQ ID NO: 14-26 and to distinguish 

35 between SEQ ID NO: 14-26 and similar polynucleotide sequences. The useful fragments are the 
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fragment of SEQ ID NO: 14 from about nucleotide 135 to about nucleotide 189, the 
fragment of SEQ ID NO: 1 5 from about nucleotide 493 to about nucleotide 558, the 
fragment of SEQ ID NO: 16 from about nucleotide 1 170 to about nucleotide 1233, the 
fragment of SEQ ID NO: 17 from about nucleotide 939 to about nucleotide 996, the 
5 fragment of SEQ ID NO: 1 8 from about nucleotide 424 to about nucleotide 486, the 
fragment of SEQ ID NO: 19 from about nucleotide 274 to about nucleotide 333, and the 
fragment of SEQ ID NO:20 from about nucleotide 1013 to about nucleotide 1070, the 
fragment of SEQ ID NO:21 from about nucleotide 284 to, about nucleotide 325, the fragment of 
SEQ ID NO:22 from about nucleotide! 642 to about hucleotide 674, the fragment of SEQ ID 
10 NO:23 from about nucleotide 742 to about nucleotide 769, the fragment of SEQ ID NO:24 from 
about nucleotides^ fragment of SEQTDT^O:25^fronTabout 

nucleotide 205 to about nucleotide 246, and the fragment of SEQ ID NO:26 from about nucleotide 
3 1 9 to about nucleotide 342. > 

The invention also encompass e s CSIGP v a riants. A preferred CS IGPvariant is one which 
15 has at least about 80%, more preferably at least about 90%, and most preferably at least about 95% 
amino acid sequence identity to the CSIGP amino acid sequence, and which contains at least one 
functional or structural characteristic of CSIGP. 

The invention also encompasses polynucleotides which encode CSIGP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
20 selected from the group consisting of SEQ ID NO: 1 4-26 which encodes CSIGP. 

The invention also encompasses a variant of a polynucleotide sequence encoding CSIGP. 
In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 
at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding CSIGP. A particular aspect of the invention encompasses a 
25 variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
SEQ ID NO: 14-26 which has at least about 70%, more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO: 14-26. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
30 characteristic of CSIGP 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding CSIGP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
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sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring CSIGP, and all such variations are to be 
considered as being specifically disclosed. 
5 Although nucleotide sequences which encode CSIGP and its variants are preferably 

capable of hybridizing to the nucleotide sequence of the naturally occurring CSIGP under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding CSIGP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at 

10 which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance 
with the frequency with which particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding CSIGP and its derivatives without altering 
the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 

15 occurring sequence. 

The invention also encompasses production of DNA sequences which encode CSIGP and 
CSIGP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 

20 introduce mutations into a sequence encoding CSIGP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO: 14-26 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

25 1 52:507-5 1 1 .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCI and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 
stringency hybridization can be obtained in the absence of organic solvent, e.g., form amide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 

30 and most preferably at least about 50% formamide. Stringent temperature conditions will 

ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C, and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 
the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 

35 accomplished by combining these various conditions as needed. In a preferred embodiment, 
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hybridization will occur at 30°C in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37°C in 500 mM Nad, 50 mM trisodium 
citrate, 1% SDS, 35% formamide, and 100 jig/ml denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42°C in 250 mM NaCl, 25 mM trisodium 
5 citrate, 1% SDS, 50 % formamide, and 200 ^g/ml ssDNA. Usefiil variations on these conditions 
will be readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 
stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 

10 example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1 .5 mM 
trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25°C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C. In a preferred embodiment, wash steps will occur at 25°C in 30 mM NaCl, 3 

15 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 
42°C in 15 mM NaCl, 1 .5 mM trisodium. citrate, and 0.1% SDS. In a most preferred embodiment, 
wash steps will occur at 68°C in 1 5 mM NaCl, 1 .5 mM trisodium citrate, and 0. 1 % SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 

i 

20 of the embodiments of the invention. The methods may employ such enzymes as the Klenow 

fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase 
(Perkin-Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 

25 preparation is automated with machines such as the HYDRA microdispenser (Robbins Scientific, 
Sunnyvale CA), MICROLAB 2200 (Hamilton, Reno NV), Peltier Thermal Cycler 200 (PTC200; 
MJ Research, Watertown MA) and the ABI CATALYST 800 (Perkin-Elmer). Sequencing is then 
carried out using either ABI 373 or 377 DNA Sequencing Systems (Perkin-Elmer) or the 
MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA). The 

30 resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, F.M. (1997) Short Protocols in Molecular Biology . John Wiley & Sons, New 
York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New 
York NY, pp. 856-853.) 

The nucleic acid sequences encoding CSIGP may be extended utilizing a partial 

35 nucleotide sequence and employing various PCR-based methods known in the art to detect 
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upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown 
sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Applic. 2:318-322.) Another method, inverse PCR, uses primers that extend in divergent 
5 directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
e.g., Triglia, T. et ah ( 1 988) Nucleic Acids Res. 1 6:8 1 86.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:1 1 1-119.) In 

10 this method; multiple restriction enzyme, digestions and ligations may be used to insert an 

engineered double-stranded-seq uencennt^ ing PCR. 

Other methods which may be used to retrieve unknown sequences are known in the art (See, e.g., 
Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, 
nested primers, andPROMOTEREINDERHbiranes (Cto^ genomic 

15 DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at, temperatures of about 68°C to 72°C. 

20 When screening for fiill-length cDNAs, it is preferable to use libraries that have been 

size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5 1 regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into 5' non-transcribed regulatory regions. 

25 Capillary electrophoresis systems which are commercially available may be used to 

analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 

30 using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), 
and the entire process from loading of samples to computer analysis and electronic data display 
may be computer controlled. Capillary electrophoresis is especially preferable for sequencing 
small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 

35 which encode CSIGP may be cloned in recombinant DNA molecules that direct expression of 
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CSIGP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code,* other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 
CSIGP. 

5 The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter CSIGP-ericoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 

and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 

i 

10 oligonucleotide-mediated site-directeci mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding CSIGP may be synthesized, in whole or in 
part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucl. 

15 Acids Res. Symp. Sen 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232.) 
Alternatively, CSIGP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of 

20 CSIGP, or any part thereof, may Jbe altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392- 
421 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 

25 sequencing. (See, e.g., Creighton, T. (1984) Proteins. Structures and Molecular Properties , WH 
Freeman, New York NY.) 

In order to express a biologically active CSIGP, the nucleotide sequences encoding 
CSIGP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and translational control of the inserted 

30 coding sequence in a suitable host. These elements include regulatory sequences, such as 

enhancers, constitutive and inducible promoters, and 5' and 3* untranslated regions in the vector 
and in polynucleotide sequences encoding CSIGP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding CSIGP. Such signals include the ATG initiation codon and adjacent 

35 sequences, e.g. the Kozak sequence. In cases where sequences encoding CSIGP and its initiation 
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codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 

4 

additional transcriptional or transiational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous transiational, control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
5 transiational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

i 

Methods which are well known to those skilled in the art may be used to construct 

10 expression vectors containing sequences encoding CSIGP and appropriate transcriptional and 

— -^nslational contra l^lements^-These-m techniques, 
synthetic techniques, and jnvivo genetic recombination. (See, e.g., Sambrook, J. et ai. (1989) 
Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
— 16J7; Ausubel, F.M. et al. g^95X jCurrent-Protocolsjn-Molecu4ar^Biologv . John Wiley & Sons, 

1 5 New York NY, ch. 9, 1 3, and 1 6.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding CSIGP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, piasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 

20 expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial 
expression vectors (e.g., Ti or pBR322 piasmids); or animal cell systems. The invention is not 
limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 

25 depending upon the use intended for polynucleotide sequences encoding CSIGP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding CSIGP can be 
achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) 
or pSPORTl piasmid (Life Technologies). Ligation of sequences encoding CSIGP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 

30 for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for ULvitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 
and S.M. Schuster ( 1 989) J. Biol. Chem. 264:5503-5509.) When large quantities of CSIGP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of CSIGP 

35 may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
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promoter may be used. 

Yeast expression systems may be used for production of CS1GP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors 
5 direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995. supra : 
Grant et al. (1?87) Methods Enzymol. 153:516-'54; and Scorer, C. A. et al. (1994) Bio/Technology 
12:181-184.) 

Plant systems may also be used for expression of CSIGP. Transcription of sequences 
10 encoding CSIGP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 
EMBO J. 6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or 

heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 

i 

Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
15 Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 

transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, sequences encoding CSIGP may be ligated 

i 

20 into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses CSIGP in host cells. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 

25 SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, poiycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J. J. et al. (1997) Nat Genet. 

30 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of CSIGP in cell lines is preferred. For example, sequences encoding CSIGP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
35 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
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days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 
5 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk or apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, 
antibiotic, or herbicide resistance can be used as thejbasis for selection. For example, dhfr confers 

10 resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-41 8; 
and als ox pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; 
Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14) Additional selectable genes have been 
described, e.g., trpB and MsD 9 which alter cellular requirements for metabolites. (See, e.g., 

15 Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-805 1.) Visible markers, 
e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), fl glucuronidase and its substrate 
B-glucuronide, or luciferase and its substrate luciferin may be used. These markers can be used 
not only to identify transformants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. 

20 Biol. 55:121-131.) ! 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 
example, if the sequence encoding CSIGP is inserted within a marker gene sequence, transformed 
cells containing sequences encoding CSIGP can be identified by the absence of marker gene 

25 function. Alternatively, a marker gene can be placed in tandem with a sequence encoding CSIGP 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding CSIGP and that 
express CSIGP may be identified by a variety of procedures known to those of skill in the art 

30 These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
protein sequences. 

Immunological methods for detecting and measuring the expression of CSIGP using either 
35 specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
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include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated' cell sorting (FACS). A two-site, monoclonal- based immunoassay utilizing 
monoclonal antibodies reactive to two non- interfering epitopes on CSIGP is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art 
5 (See, e.g., Hampton, R. et al. ( 1 990) Serological Methods, a Laboratory Manual . APS Press, St 
Paul MN, Sect IV; Coligan, J. E. et al.'(1997) Current Protocols in Immunology . Greene Pub. 
Associates and Wiley-Interscience, New York htY; and Pound, J.D. (1998) Immunochemical 
Protocols.Humana Press, Totowa NJ). ' 

A wide variety of labels and conjugation techniques are known by those skilled in the art 

10 and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding CSIGP 
include oiigolabeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding CSIGP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 

15 commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 

molecules or labels which may be used for ease of detection include radionuclides, enzymes, 

t 

20 fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. I 

Host cells transformed with nucleotide sequences encoding CSIGP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 

25 sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode CSIGP may be designed to contain signal 
sequences which direct secretion of CSIGP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications 

30 of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

phosphorylation, lipidation, and acyiation. Post-transiational processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from 

35 the American Type Culture Collection (ATCC, Bethesda MD) and may be chosen to ensure the 
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correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding CSIGP may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric CSIGP protein 
5 containing a heterologous moiety that can be recognized by a commercially available antibody 
may facilitate the screening of peptide libraries for inhibitors of CSIGP activity. Heterologous 
protein and peptide 'moieties may also facilitate purification effusion proteins using commercially 
available affinity matrices. Such moieties: include, but are not limited to, glutathione S-transferase 
(GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6- 
10 His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification 
of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, 
calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 
immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 

1 5 engineered to contain a proteolytic cleavage site located between the CSIGP encoding sequence 
and the heterologous protein sequence, so that CSIGP may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are 
discussed in Ausubel (1995, supra, ch 10). A variety of commercially available kits may also be 
used to facilitate expression and purification of fusion proteins. 

20 In a further embodiment of the invention, synthesis of radiolabeled CSIGP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 1 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably 35 S-methionine. 

25 Fragments of CSIGP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 
may be achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Various 
fragments of CSIGP may be synthesized separately and then combined to produce the full length 

30 molecule. 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between CSIGP and cell signaling proteins. In addition, the expression of CSIGP is closely 
35 associated with cell proliferation and inflammatory disorders. Therefore, in cell proliferative and 
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inflammatory disorders where CSIGP is an inhibitor or suppressor of cell proliferation, it is 
desirable to increase the expression of CSIGP. In cell proliferative and inflammatory disorders 
where CSIGP is an activator or enhancer and is promoting cell proliferation, it is desirable to 
decrease the expression of CSIGP. ' 
5 Therefore, in one embodiment, CSIGP or a fragment or derivative thereof may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 
mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

r 

10 polycythemia vera, psoriasis, primary thrombocythemia; cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an inflammatory 

15 disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult 

respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia^ asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 

20 nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

25 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma. 

In another embodiment, a vector capable of expressing CSIGP or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 

30 expression or activity of CSIGP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified CSIGP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of CSIGP 
including, but not limited to, those provided above. 

35 In still another embodiment an agonist which modulates the activity of CSIGP may be 

-26- 



t 



, i WO 99/58558 PCT/US99/10567 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of CSIGP may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of CSIGP. Examples of 
5 such disorders include, but are not limited to, those described above. In one aspect, an antibody 
which specifically binds CSIGP may be used, directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
CSIGP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
10 encoding CSIGP may be administered to a subject to treat or prevent a disorder associated with 
^creased-expression-o^activ4Q^a f CSIGP in clud ing , but n o t-limited to,-thos^described above. 
In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 
_QAer„appropriate.therapeutic_agents._SeIection-of theappropriateagents-for use in combination 
15 therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of th-erapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

20 An antagonist of CSIGP may be produced using methods which are generally known in 

the art. In particular, purified CSIGP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind CSIGP. Antibodies to CSIGP may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, 

25 and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 
inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with CSIGP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, various 

30 adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

35 11 is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
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CSIGP have an amino acid sequence consisting of at least about 5 amino acids, and, more 
preferably, of at least about 1 0 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 
5 CSIGP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to CSIGP may be prepared using any technique which provides for 
the production of antibody molecules by continuous ceil lines in culture. These include, but are 
not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 

10 hybridoma technique. (See,, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

15 antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. 81 :685 1-6855; Neuberger, M.S! et al. (1984) Nature 3 12:604-608; and 
Takeda, S. et ai. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 

CSIGP-specific single chain antibodies. Antibodies with related specificity, but of distinct 

i 

20 idiotypic composition, may be generated by chain shuffling from random combinatorial 

immunoglobulin libraries. (See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134 r 10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86: 

25 3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for CSIGP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab*)2 fragments. Alternatively, Fab expression libraries may be 

30 constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 
desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 

35 the art. Such immunoassays typically involve the measurement of complex formation between 
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CSIGP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering CSIGP epitopes is preferred, but a 
competitive binding assay may also be employed (Pound, supra V 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
5 techniques may be used to assess the affinity of antibodies for ABBR. Affinity is expressed as an 
association constant, K,, which is defined as the molar concentration of ABBR-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium 
conditions. The K, determined for a preparation of polyclonal antibodies, which are 
heterogeneous in their affinities for multiple ABBR epitopes, represents the average affinity, or 
10 avidity, of the antibodies for ABBR. The K, determined for a preparation of monoclonal 
--antibodies^^ict^^-monospecific foi^a-particular ABBR-epitope, represents a true measure of 
affinity. High-affinity antibody preparations with K, ranging from about 10 9 to 10 12 L/mole are 
preferred for use in immunoassays in which the ABBR-antibody complex must withstand rigorous 
-manipulat ions . 1 ,ow-affinity a ntiho dy^reparations-with-K^ranging from-aboutlOUo lO 7 JVmole 
15 are preferred for use in immunopurification and similar procedures which ultimately require 

dissociation of ABBR, preferably in active form, from the antibody (Catty, D. (1988) Antibodies. 
Volume I: A Practical Approach. IRL Press, Washington DC; Liddell, J. E. and Cryer, A. (1991) 
A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
20 determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/nil, 
preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 
of ABBR-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 
25 (See, e.g., Catty, supra, and Coligan et ah supra .) 

In another embodiment of the invention, the polynucleotides encoding CSIGP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding CSIGP may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
30 sequences complementary to polynucleotides encoding CSIGP. Thus, complementary molecules 
or fragments may be used to modulate CSIGP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding CSIGP. 

35 Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
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or from various bacterial plasm ids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. • Methods which are well known to those skilled in the art 
can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding CSIGP. (See, e.g., Sambrook, supra : Ausubel, 199S. supra /* 
5 Genes encoding CSIGP can be turned off by transforming a cell or tissue with expression 

vectors which express high levels of a polynucleotide; or fragment thereof, encoding CSIGP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 

molecules until they are disabled by endogenous nuclease^. Transient expression may last for a 

i 

10 month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5\ or 
regulatory regions of the gene encoding CSIGP. Oligonucleotides derived from the transcription 

15 initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 

Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. 

20 (1994) in Huber, B.E. and B.I. Caijr, Molecular and Immunologic Approaches . Future Publishing, 
Mt. Kisco NY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 
of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 

25 ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding CSIGP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: 

30 GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 

ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
with complementary oligonucleotides using ribonuclease protection assays. 

35 Complementary ribonucleic acid molecules and ribozymes of the invention may be 
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prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, ( RNA molecules may be generated by in vitro 
and in vivo transcription of DNA sequences encoding CSIGP. Such DNA sequences may be 
5 incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 1 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but; are not limited to, the addition of flanking sequences at the 5' and/or 3 f 
10 ends of the molecule, or the J use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of non traditional bases such as 
inosine, queosine, and wybutosine, as well, as 

acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and 
15 uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo, in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 
stem cells taken from the patient and clonally propagated for autologous transplant back into that 
same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 

20 may be achieved using methods which are well knowW in the art. (See, e.g., Goldman, C.K. et al. 
(1997) Nature Biotechnology 15:462-466.) 1 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

25 An additional embodiment of the invention relates to the administration of a 

pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 
of CSIGP, antibodies to CSIGP, and mimetics, agonists, antagonists, or inhibitors of CSIGP. The 
compositions may be administered alone or in combination with at least one other agent, such as a 

30 stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical 
carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions 
may be administered to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

35 intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
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enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceutical ly-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
5 pharmaceutical^. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutical iy acceptable carriers well known in the art in dosages suitable for oral 

administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 

t 

10 pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 

15 excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 
methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 

20 agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 

25 product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 

30 capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 

35 substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
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cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 

i 

prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
5 suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

For topical pr nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 

Hie pharmaceutical compositions of the present invention may be manufactured in a 
10 manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
15 corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% 
sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
20 appropriate container and labeled 1 for treatment of an indicated condition. For administration of 
CSIGP, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
25 art 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 

30 for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
CSIGP or fragments thereof, antibodies of CSIGP, and agonists, antagonists or inhibitors of 
CSIGP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 

35 such as by calculating the ED^ (the dose therapeutically effective in 50% of the population) or 
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LD5Q (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, and it can be expressed as the LD^ ED^ ratio. Pharmaceutical' 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 
5 dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the ED^ with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, the sensitivity of the patient,,and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 

10 the active moiety or to maintain the desired effect Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 

15 rate of the particular formulation. 

Normal dosage amounts may vary from about' 0.1 jug to 100,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 

20 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. , 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind CSIGP may be used for the 
diagnosis of cell proliferative and inflammatory disorders characterized by expression of CSIGP, 
25 or in assays to monitor patients being treated with CSIGP or agonists, antagonists, or inhibitors of 
CSIGP. Antibodies useful for diagnostic purposes may be prepared in the same manner as 
described above for therapeutics. Diagnostic assays for CSIGP include methods which utilize the 
antibody and a label to detect CSIGP in human body fluids or in extracts of cells or tissues. The 
antibodies may be used with or without modification, and may be labeled by covalent or non- 
30 covalent attachment of a reporter molecule. A wide variety of reporter molecules, several of which 
are described above, are known in the art and may be used. 

A variety of protocols for measuring CSIGP, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of CSIGP 
expression. Normal or standard values for CSIGP expression are established by combining body 
35 fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
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CSIGP under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, preferably by photometric means. Quantities of 
CSIGP expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
5 diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding CSIGP may be 
used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
sequences, complementary RNA and DNA molecules,- and PNAs. The polynucleotides may be 
used to detect and quantitate gene expression in biopsied tissues in which expression of CSIGP 
10 may be correlated with disease. The diagnostic assay may be used to determine absence, 
— presenee^d^^ss^ression^^ 
therapeutic intervention. 

In one aspect, hybridization with P,CR probes which are capable of detecting 
-4>olynucleotide sequences^including^genomic^sequences^en coding CS IGRor-closely related 
15 molecules may be used to identify nucleic acid sequences which encode CSIGP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5* regulatory region, or 
from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 
only naturally occurring sequences encoding CSIGI>, allelic variants, or related sequences. 
20 Probes may also be used for the detection of related sequences, and should preferably 

have at least 50% sequence identity to any of the CSIGP encoding sequences. The hybridization 
probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO: 14-26 or from genomic sequences including promoters, enhancers, and introns of the 
CSIGP gene. 

25 Means for producing specific hybridization probes for DNAs encoding CSIGP include the 

cloning of polynucleotide sequences encoding CSIGP or CSIGP derivatives into vectors for the 
production ofmRNA probes. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 

30 variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding CSIGP may be used for the diagnosis of cell 
proliferative and inflammatory disorders associated with expression of CSIGP. Examples of such 
disorders include, but are not limited to, a disorder of cell proliferation such as actinic keratosis, 

35 arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
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(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia; cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, 
bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
5 liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus; and an inflammatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact dermatitis, Crohn's 

i 

10 disease, atopic dermatitis, dermatomybsitis, diabetes mellitus, emphysema, episodic lymphopenia 
with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Re iter's 

15 syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrom^, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma. The polynucleotide 
sequences encoding CSIGP may be used in Southern or northern analysis, dot blot, or other 

20 membrane-based technologies; ii^ PCR technologies; in dipstick, pin, and ELISA assays; and in 
microarrays utilizing fluids or tissues from patients to detect altered CSIGP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding CSIGP may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The 

25 nucleotide sequences encoding CSIGP may be labeled by standard methods and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the presence of altered levels of nucleotide 

30 sequences encoding CSIGP in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
CSIGP, a normal or standard profile for expression is established. This may be accomplished by 

35 combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
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sequence, or a fragment thereof, encoding CSIGP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
5 values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
the patient begins to approximate that which is observed in the normal subject The results 

10 obtained from successive assays may be used to show the efficacy of treatment over a period 

ra nging from several-days to^nonths: 

With respect to cancer, the presence of a relatively high amount of transcript in biopsied 
tissue from an individual may indicate a predisposition for the development of the disease, or may 
—provide a means for ^detecting4he^isease-prior^o4he^pp A 

15 more definitive diagnosis of this type may allow health professionals to employ preventative 
measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
CSIGP may involve the use of PCR. These oligomers may be chemically synthesized, generated 

20 enzymatically, or produced in vitfo . Oligomers will preferably contain a fragment of a 
polynucleotide encoding CSIGP, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding CSIGP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 
stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 

25 Methods which may also be used to quantitate the expression of CSIGP include 

radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochenu 229-236.) The speed of 
quantitation of multiple samples may be accelerated by running the assay in an ELISA format 

30 where the oligomer of interest is presented in various dilutions and a spectrophotometry or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes simultaneously 

35 and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
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determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, 
e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25 1 1 16; Shalon, 
D. et al. (1995) PCT application WO95/35505; Heller, RA. et al. (1997) Proc ? Natl. Acad. Sci. 
94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) ( 

In another embodiment of the invention, nucleic acid sequences encoding CSIGP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic 
sequence. The sequences m£y be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J J. et al. (1997) 
Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. (1991)Trends 
Genet. 7:149-154.) ! 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 
location of the gene encoding CSIGP on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

IlLsitu hybridization of chromosomal preparations and physical mapping techniques, such 
as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
mapping. This provides valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or syndrome has .been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
sequence of the subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

-38- 



» WO 99/58558 



PCT/US99/10567 



In another embodiment of the invention, CSIGP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution, 
affixed to a solid support, borne on a cell surface, or located intracetlularly. The formation of 
5 binding complexes between CSIGP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method^ large numbers of different small test 
compounds are synthesized on a solid substrate. The test compounds are reacted with CSIGP, or 
10 fragments thereof, and washed. Bound CSIGP is then detected by methods well known in the art. 
Purified CSIGP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support i 

In another embodiment, one may use competitive drug screening assays in which 
15 neutralizing antibodies capable of binding CSIGP specifically compete with a test compound for 
binding CSIGP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with CSIGP. 

In additional embodiments, the nucleotide sequences which encode CSIGP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely 
20 on properties of nucleotide sequences that are currently known, including,, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following preferred 
specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
25 the remainder of the disclosure in any was whatsoever. 

The entire disclosure of all applications, patents, and publications, cited above and below, 
and of US provisional applications 60/085,343 (filed May 13, 1998), and 60/098,010 (filed 
August 26, 1998) are hereby incorporated by reference. 

EXAMPLES 
30 L Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
35 lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
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from the lysates with either isopropariol or sodium acetate and ethanol, or by other routine 
methods. ' , 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
5 isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Valencia CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA, isolation kits, e.g., the 1 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
10 cDNA libraries. Otherwise, c^NA was synthesized and cDNA libraries were constructed with the 
UNI ZAP vector sy stem (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 

1997, supra, units 5.1-6.6). Reverse transcription was initiated using oligo d(T) or random 

i 

primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 

15 was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 
agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl plasmid 

20 (Life Technologies), or pINCY (Incyte Pharmaceutical?; Palo Alto C A). Recombinant plasmids 
were transformed into competent RcoH cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DHSct, DH10B, or ElectroMAX DH10B from Life Technologies, 
n. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP vector 

25 system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep 
purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water 

30 and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in 
a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified 

35 fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan II 
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fluorescence scanner (Labsystems Oy, Helsinki, Finland), 
m. Sequencing and Analysis 

The cDNAs were prepared for sequencing, using either an ABI CATALYST 800 (Perkin- 
Elmer) or a HYDRA microdispenser (Robbins) or MICROLAB 2200 (Hamilton) sequencing 
5 preparation system in combination with PTC-200 thermal cyclers (MJ Research). The cDNAs 
were sequenced using the ABI PRISM 373 or 377 sequencing systems of the MEGABACE 1000 
DNA sequencing system (Molecular Dynamics) and ABI protocols, base calling software, and kits 
(Perkin-EImer). Alternatively, solutions and dyes from Amersham Pharmacia Biotech were used. 
Reading frames were determined using standard methods (Ausubel, 1997. supra ). Some of the 

10 cDNA sequences were selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
were assembled and analyzed using a combination of software programs which utilize algorithms 
well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 

15 and algorithms used, the second column provides a brief description thereof, the third column 
presents the references which are incorporated by reference herein, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the probability the greater the 
homology). Sequences were analyzed using MACI^NASIS PRO software (Hitachi Software 

20 Engineering, S. San Francisco CA) and LASERGENE software (DNASTAR). 

cDNAs were also compared to sequences in GenBank using a search algorithm developed 
by Applied Biosystems and incorporated into the INHERIT™ 670 sequence analysis system. In 
this algorithm, Pattern Specification Language (TRW Inc, Los Angeles, CA) was used to 
determine regions of homology. The three parameters that determine how the sequence 

25 comparisons run were window size, window offset, and error tolerance. Using a combination of 
these three parameters, the DNA database was searched for sequences containing regions of 
homology to the query sequence, and the appropriate sequences were scored with an initial value. 
Subsequently, these homologous regions were examined using dot matrix homology plots to 
distinguish regions of homology from chance matches. Smith- Waterman alignments were used to 

30 display the results of the homology search. 

Peptide and protein sequence homologies were ascertained using the INHERIT- 670 
sequence analysis system using the methods similar to those used in DNA sequence homologies. 
Pattern Specification Language and parameter windows were used to search protein databases for 
sequences containing regions of homology which were scored with an initial value. Dot-matrix 

35 homology plots were examined to distinguish regions of significant homology from chance 
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matches. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 

5 queried against a selection of public databases such as GenBank primate, rodent, mammalian, 

i 

vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 

reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 

i 

1 0 polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
PFAM, and Prosite. 

The programs described above for the assembly and analysis of full length polynucleotide 
15 and amino acid sequences were also used to identify polynucleotide sequence fragments from 
SEQ ID NO: 14-26. Fragments fi;om about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
20 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook. supra , ch. 7; 
Ausubel, 1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in nucleotide databases such as GenBank or LIFESEQ database (Incyte 
25 Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
addition, the sensitivity of the computer search can be modified to determine whether any 
particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: % sequence identity x % maximum BLAST score 

100 

30 The product score takes into account both the degree of similarity between two sequences 

and the length of the sequence match. For example, with a product score of 40, the match will be 
exact within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
molecules are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

35 The results of northern analyses are reported a percentage distribution of libraries in which 
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the transcript encoding CSIGP occurred. Analysis involved the categorization of cDNA libraries 
by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease or condition categories included cancer, 
5 inflammation/trauma, cell proliferation, neurological, and' pooled. For each category, the number 
of libraries expressing the sequence of interest was counted and divided by the total number of 
libraries across all categories. Percentage values of tissue-specific and disease expression are 
reported in Table 3. 

i 

V. Extension of CSIGP Encoding Polynucleotides 
10 The ftill length nucleic acid sequence of SEQ ID NO: 14-26 was produced by extension of 

aira PPropriaterf^ from 

this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the 
other primer, to initiate 3' extension of the known fragment. The initial primers were designed 
using OHGe^06^oftware^ationaH3iosciene« be about 

15 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which 
would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

20 High fidelity amplification was obtained by PCR using methods well known in the art. 

PCR was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, 
(NR^SO^ and 0-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

25 following parameters for primer pair PCI A and PCI B: Step 1 : 94°C, 3 min; Step 2: 94 °C, 1 5 sec- 
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 
68°C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

30 The concentration of DNA in each well was determined by dispensing 100 }xl 

PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 \x\ of undiluted PCR product into each well of an opaque fluorimeter 
plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 
scanned in a Fluoroskan II (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the 

35 sample and to quantify the concentration of DNA. A 5 fil to 1 0 ^1 aliquot of the reaction mixture 
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was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
successful in extending the sequence. 1 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
5 sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New, England Biolabs, Beverly MA) into pUC 1 8 vector 

(Amersham Pharmacia Biotech), treated with Pfij DNA polymerase (Stratagene) to fill-in 

i 

10 restriction site overhangs, and transfected into competent E. coli cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
37°C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

15 parameters: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°fc, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PlboGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

diluted with 20% dimethysulphoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 

t 

20 sequencing primers and the DYENAMIC DIRECT kit, (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). i 

In like manner, the nucleotide sequence of SEQ ID NO: 14-26 is used to obtain 5* 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, 
and an appropriate genomic library. 

25 VI* Choice, Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 14-26 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

30 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 A*Ci of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPontNEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 

35 based hybridization analysis of human genomic DNA digested with one of the following 
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endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xbal, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
5 under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium dodecyl 
sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to the blots to film 
for several hours, hybridization patterns are compared visually. 

VII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array 
10 elements on the surface of a substrate. (See, e.gj, Baldeschweiler, supra .) An array analogous to a 
dot or slot-blot m^y also be u sed to ar ran ge a mHink elements to the s urface of a ~substrate~using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 
hand or using available methods and machines and contain any appropriate number of elements. 
AfterhybridizatienTflonhyferidized-probes are removed-and a-seartnfer used to determine the levels 
15 and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
probe which hybridizes to an element on the microarray may be assessed through analysis of the 
scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
comprise the elements of the microarray. Fragments suitable for hybridization can be selected 

20 using software well known in the art such as LASERCjENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 

25 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 

VIII. Complementary Polynucleotides 

Sequences complementary to the CSIGP-encoding sequences, or any parts thereof, are 
30 used to detect, decrease, or inhibit expression of naturally occurring CSIGP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 
CSIGP. To inhibit transcription, a complementary oligonucleotide is designed from the most 
35 unique 5* sequence and used to prevent promoter binding to the coding sequence. To inhibit 
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translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
CSIGP-encoding transcript. t 
DC Expression of CSIGP 

Expression and purification of CSIGP is achieved using bacterial or virus-based 
5 expression systems. For expression of CSIGP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
(tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 
operator regulatory element Recombinant vectors are transformed into suitable bacterial hosts, 

10 e.g., BL21(DE3). Antibiotic resistant bacteria express CSIGP upon induction with isopropyl beta- 
D-thiogalactopyranoside (IPTG). Expression of CSIGP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant Autographica californica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin 
gene of baculovirus is replaced with cDNA encoding CSIGP by either homologous recombination 

15 or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcriptipn. 
Recombinant baculovirus is used to infect Soodoptera frueioerda (Sf9) insect cells in most cases, 
or human hepatocytes, in some cases. Infection of the latter requires additional genetic 
modifications to baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. USA 

20 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Then 7: 1937-1945.) 

In most expression systems, CSIGP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. 
GST, a 26-kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion 

25 proteins on immobilized glutathione under conditions that maintain protein activity and 

antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from CSIGP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 
polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 

30 residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995. supra , ch 10 and 16). Purified CSIGP obtained 
by these methods can be used directly in the following activity assay. 
X. Demonstration of CSIGP Activity 

CSIGP activity can be assayed jn vitro by monitoring the mobilization of Ca** as part of 

35 the signal transduction pathway. (See, e.g., Grynkievwicz, G. et al. (1985) J. Biol. Chem. 
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260:3440; McColl, S. et al. (1993) J. Immunol. 150:4550-4555; and Aussel, C. et al. (1988) supra) 
The assay requires preloading neutrophils or T cells with a fluorescent dye such as FURA-2 or 
BCECF (Universal Imaging Corp, Westchester PA) whose emission characteristics have been 
altered by Ca~ binding. When the cells are exposed to one or more activating stimuli artificially 
5 (ie, anti-CD3 antibody ligation of the T cell receptor) or physiologically (ie, by allogeneic 

stimulation), Ca~ flux takes place. This flux can ( be observed and quantified by assaying the cells 
in a fluorometer or fluorescent activated cell sorter. Measurements of Ca** flux are compared 
between cells in their normal state and those' preloaded with CSIGP. 

Protein kinase activity in CSIGP is determined by measuring the phosphorylation of a 
10 protein substrate using gamma- labeled 32 P^ATP and quantitation of the incorporated radioactivity 
using a radioisotope counter. CSIGP is incubated with the protein substrate, 32 P-ATP, and an 
appropriate kinase buffer. The 32 P incorporated into the product is separated from free 32 P-ATP by 
electrophoresis and the incorporated 32 P is counted. The amount of 32 P recovered is proportional 
to the activity of CSIGP in the assay. A determination of the specific amino acid residue t 

15 phosphorylated is made by phosphoamino acid analysis of the hydrolyzed protein. 

Protein phosphatase (PP) activity in CSIGP is determined by measuring the hydrolysis of 
P-nitrophenyl phosphate (PNPP). CSIGP is incubated together with PNPP in HEPES buffer pH 
7.5, in the presence of 0.1% b-mercaptoethanol at 37°C for 60 min. The reaction is stopped by the 
addition of 6 ml of 10 N NaOH and the increase in light absorbance at 410 nm resulting from the 

20 hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance is 
proportional to the activity of CSIGP in the assay. 1 
XI. Production of CSIGP Specific Antibodies 

CSIGP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is 

25 used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the CSIGP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 

30 well described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A 
Peptide Synthesizer (Perkin-Elmer) using fmoc-chemistiy and coupled to KLH (Sigma-Aldrich, 
St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995,suEra.) Rabbits are immunized with the 

35 oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 
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antipeptide activity by, for example, binding the peptide to plastic, blocking with 1% BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XII. Purification of Naturally Occurring CSIGP Using Specific Antibodies 
Naturally occurring or recombinant CSIGP is substantially purified by immunoaffinity 

5 chromatography using antibodies specific for CSIGP. An immunoaffinity column is constructed 
by covalently coupling anti-CSIGP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

i 

blocked and washed according to the manufacturer's instructions. 

Media containing CSIGP are passed over the immunoaffinity column, and the column is 
10 washed under conditions that allow the preferential absorbance of CSIGP (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/CSIGP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and CSIGP is collected. 

i 

XIII. Identification of Molecules Which Interact with CSIGP 

15 CSIGP, or biologically active fragments thereof, are labeled with 123 I Boiton-Hunter 

reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled CSIGP, washed, and any 
wells with labeled CSIGP complex are assayed. Data obtained using different concentrations of 
CSIGP are used to calculate values for the number, affinity, and association of CSIGP with the 

20 candidate molecules. ( 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 

25 to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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What is claimed is: 

1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-13, or a fragment thereof. 
5 2. A substantially purified variant having at least 90% amino acid identity to the 

amino acid sequence of claim 1. | 

3. 'An isolated and purified polynucleotide encoding the polypeptide of claim 1. 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

10 5. An isolated And purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps pf: 
15 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in 

the sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
20 hybridization. 1 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 14-26, or a fragment thereof. 

10. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 9. 

25 11. An isolated and purified polynucleotide having a sequence which is 

complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

1 3. A host cell comprising the expression vector of claim 12. 

30 14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 
expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
35 conjunction with a suitable pharmaceutical carrier. 
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16. A purified antibody which specifically binds to the polypeptide of claim 1 . 

1 7. A purified agonist of the polypeptide of claim 1 . 

18. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased 

5 expression of CSIGP, the method comprising administering to a subject in need of such treatment 
an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
expression of CSIGP, the method comprising administering to a subject in need of such treatment 

an effective amount of the antagonist of claim 18. 1 

i 

10 
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<400> 3 f 

Met Gin Ser Thr Ser Asn His Leu Trp Leu Leu Ser Asp lie Leu 
1 5 10 15 

Gly Gin Gly Ala Thr Ala Asn Val Phe Arg Gly Arg His Lys Lys 
20 25 ( 30 

Thr Gly Asp Leu Phe Ala lie Lys Val Phe 'Asn Asn lie Ser Phe 
35 40 45 

Leu Arg Pro Val Asp Val Gin Met Arg Glu Phe Glu Val Leu Lys 
50 55 60 

Lys Leu Asn His Lys Asn lie Val Lys Leu Phe Ala lie Glu Glu 
65 70 75 

Glu Thr Thr Thr Arg His Lys Val Leu lie Met Glu Phe Cys Pro 
'80 85 90 

Cvs Gly Ser Leu Tyr Thr Val Leu Glu Glu Pro' Ser Asn Ala Tyr 
95 : 100 ■ 105 

Gly Leu Pro Glu Ser Glu Phe Leu lie Val Leu Arg Asp Val Val 
110 t 115 120 

Gly Gly Met Asn His Leu Arg Glu Asn Gly lie Val His Arg Asp 
125 ' ■ 130 135 

He Lys Pro Gly Asn He Met Arg Val He Gly Glu Asp Gly Gin 
i 140 145 150 

Ser Val Tyr Lys Leu Thr Asp Phe Gly Ala Ala Arg Glu Leu Glu 
155 160 165 

Asp Asp Glu Gin Phe Val Ser Leu Tyr Gly Thr Glu Glu Tyr Leu 
170 ' 175 180 

His Pro Asp Met Tyr Glu Arg Ala Val Leu Arg Lys Asp His Gin 
185 190 K 195 

Lys Lys Tyr Gly Ala Thr Val Asp Leu Trp Ser He Gly Val Thr 
200 205 210 

Phe Tyr His Ala Ala Thr Gly Ser Leu Pro Phe Arg Pro Phe Glu 
t 215 220 , 225 

Gly Pro Arg Arg Asn Lys Glu Val Met Tyr Lys He He Thr Gly 
230 235 240 

Lys Pro Ser Gly Ala He Ser Gly Val Gin Lys Ala Glu Asn Gly 
245 250 255 

Pro He Asp Trp Ser Gly Asp Met Pro Val Ser Cys Ser Leu Ser 
260 , 265 ( , 270 

Arg Gly Leu Gin Val Leu Leu Thr Pro Val Leu Ala Asn He Leu 
275 280 285 

Glu Ala Asp Gin Glu Lys Cys Trp Gly Phe Asp Gin Phe Phe Ala 
290 295 300 

Glu Thr Ser Asp He Leu His Arg Met Val He His Val Phe Ser 
305 310 315 

Leu Gin Gin Met Thr Ala His Lys He Tyr He His Ser Tyr Asn 
320 325 330 

Thr Ala Thr He Phe His Glu Leu Val Tyr Lys Gin Thr Lys He 
335 340 345 

He Ser Ser Asn Gin Glu Leu He Tyr Glu Gly Arg Arg Leu Val 
350 355 360 

Leu Glu Pro Gly Arg Leu Ala Gin His Phe Pro Lys Thr Thr Glu 
365 370 375 

Glu Asn Pro He Phe Val Val Ser Arg Glu Pro Leu Asn Thr He 
380 385 390 

Gly Leu He Tyr Glu Lys He Ser Leu Pro Lys Val His Pro Arg 
395 400 405 

Tyr Asp Leu Asp Gly Asp Ala Ser Met Ala Lys Ala He Thr Gly 
410 415 420 

Val Val Cys Tyr Ala Cys Arg He Ala Ser Thr Leu Leu Leu Tyr 
425 430 435 

Gin Glu Leu Met Arg Lys Gly He Arg Trp Leu He Glu Leu He 
440 445 450 

Lys Asp Asp Tyr Asn Glu Thr Val His Lys Lys Thr Glu Val Val 
455 460 465 

He Thr Leu Asp Phe Cys He Arg Asn He Glu Lys Thr Val Lys 
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65 70 75 

Ser Phe Leu Tyr Thr Val Asn Tyr He Trp Tyr Leu Tyr Thr Glu 
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Asn 


Ser 


Phe 


Lys 


Arg 


Met 


Asp Asp 


Glu 


Asp 


Lys 


Gin 


Glu 


Thr 










35 










40 










45 


Gin 


Ser 


Pro 


Thr 


Met 
50 


Ser 


Pro 


Leu 


Ala 


Ser 
55 


Pro 


Pro 


Ser 


Ser 


Pro 
60 


Pro 


His 


Tyr 


Gin 


Arg 


Val 


Pro 


Leu 


Ser 


His 


Gly 


Tyr 


Ser 


Lys 


Leu 










65 










70 






75 


Arg 


Ser 


Ser 


Ala 


Glu 
80 


Gin 


Met 


His 


Pro 


Ala 
85 


Pro 


Tyr 


Glu 


Ala 


Arg 
90 


Gin 


Pro 


Leu 


Val 


Gin 


Pro 


Glu 


Gly 


Ser 


Ser 


Ser 


Gly Gly 


Pro 


Gly 










95 










100 










105 


Thr 


Lys 


Pro 


Leu 


Arg 


His 


Gin 


Ala 


Ser 


Leu 


He 


Arg 


Ser 


Phe 


Ser 










110 










115 








120 


Val 


Glu 


Arg 


Glu 


Leu 


Gin 


Asp Asn 


Ser 


Ser 


Tyr 


Pro 


Asp 


Glu 


Pro 










125 










130 






135 


Trp Arg 


He 


Thr 


Glu 


Glu 


Gin Arg 


Glu 


Tyr 


Tyr 


Val 


Asn 


Gin 


Phe 










140 










145 










150 


Arg 


Ser 


Leu 


Gin 


Pro Asp 


Pro 


Ser 


Ser 


Phe 


He 


Ser 


Gly Ser Val 










155 










160 










165 


Ala 


Lys 


Asn 


Phe 


Phe 
170 


Thr 


Lys 


Ser 


Lys 


Leu 
175 


Ser 


He 


Pro 


Glu 


Leu 
180 


Ser Tyr 


He 


Trp 


Glu 


Leu 


Ser Asp Ala Asp Cys 


Asp 


Gly Ala 


Leu 










185 










190 








195 


Thr 


Leu 


Pro 


Glu 


Phe 
200 


Cys 


Ala 


Ala 


Phe 


His 
205 


Leu 


He 


Val 


Ala 


Arg 
210 


Lys Asn 


Gly Tyr 


Pro 


Leu 


Pro Glu Gly Leu. 


Pro 


Pro 


Thr 


Leu 


Gin 
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' 215 220 225 

Pro Glu Tyr Leu Gin Ala Ala Phe Pro Lys Pro Lys Trp Asp Cys 
230 235 ' 240 

Gin Leu Phe Asp Ser Tyr Ser Glu Ser Leu Pro Ala Asn Gin Gin 
245 250 f 255 

Pro Arg Asp Leu Asn Arg Met Glu Thr Ser 'Val Lys Asp Met Ala 
260 265 270 

Asp Leu Pro Val Pro Asn Gin Asp Val Thr Ser Asp Asp Lys Gin 
275 280 285 

Ala Leu Lys Ser Thr lie Asn Glu Ala Leu Pro Lys Asp Val Ser 
290 1 295 300 

Glu Asp Pro Ala Thr Pro Lys Asp Ser Asn Ser Leu Lys Ala Arg 
Q05 310 ' 315 

Pro Arg Ser Arg Ser Tyr Ser Ser Thr Ser lie 1 Glu Glu Ala Met 
320 : 325 330 

Lys Arg Gly Glu Asp Pro Pro Thr Pro Pro Pro Arg Pro Gin Lys 
y 335 . 340 345 

Thr His Ser Arg Ala Ser Ser Leu Asp Leu Asn Lys Val Phe Gin 
350 ' . 355 360 

Pro Ser Val Pro Ala Thr Lys Ser Gly Leu Leu Pro Pro Pro Pro 
, 365 370 375 

Ala Leu Pro Pro Arg Pro Cys Pro Ser Gin Ser Glu Gin Val Ser 
380 385 390 

Glu Ala Glu Leu Leu Pro Gin Leu Ser Arg Ala Pro Ser Gin Ala 
395 1 400 405 

Ala Glu Ser Ser Pro Ala Lys Lys Asp Val Leu Tyr Ser Gin Pro 
410 415 v 420 

Pro Ser Lys Pro lie Arg Arg Lys Phe Arg Pro Glu Asn Gin Ala 
425 430 , 435 

Thr Glu Asn Gin Glu Pro Ser Thr Ala Ala Ser Gly Pro Ala Ser 
I 440 445 ( 450 

Ala Ala Thr Met Lys Pro His Pro Thr Val Gin Lys Gin Ser Ser 
455 460 465 

Lys Gin Lys Lys Ala He Gin Thr Ala He Arg Lys Asn Lys Glu 
470 475 480 

Ala Asn Ala Val Leu Ala Arg Leu Asn Ser Glu Leu Gin Gin Gin 
485 , 490 t 495 

Leu Lys Glu Val His Gin Giu Arg He Ala Leu Glu Asn Gin Leu 
500 505 510 

Glu Gin Leu Arg Pro Val Thr Val Leu 
515 



<210> 11 

<211> 334 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1710702 

<400> 11 

Met Phe Arg Trp Glu Arg Ser He Pro Leu Arg Gly Ser Ala Ala 

15 10 15 

Ala Leu Cys Asn Asn Leu Ser Val Leu Gin Leu Pro Ala Arg Asn 

20 25 30 

Leu Thr Tyr Phe Gly Val Val His Gly Pro Ser Ala Gin Leu Leu 

35 40 45 

Ser Ala Ala Pro Glu Gly Val Pro Leu Ala Gin Arg Gin Leu His 

50 55 60 
Ala Lys Glu Gly Ala Gly Val Ser Pro Pro Leu He Thr Gin Val 



14/26 



WO 99/58558 



PCT/US99/10567 









65 






70 




75 


His 


Trp 


Cys Val 


Leu 


Pro 


Phe Arg Val 


Leu 


Leu' Val Leu Thr 


Ser 








80 






85 




90 


His 


Arg 


Gly He 


Gin 


Met 


Tyr Glu Ser Asn Gly Tyr Thr Met Val 








95 






100 




105 


Tyr 


Trp 


His Ala 


Leu 


Asp 


Ser Gly Asp Ala 


Ser Pro Val Gin 


Ala 








110 






115 




120 


Val 


Phe 


Ala Arg 


Gly 


He 


Ala Ala Ser 


Gly His Phe He Cys 


Val 








125 






130 




135 


Gly 


Thr 


Trp Ser 


Gly Arg 


Val Leu Val 


Phe 


Asp He Pro Ala 


Lys 








140 






145. 




150 


Gly 


Pro 


Asn' He 


iVal 


Leu 


Ser Glu Glu 


Leu 


Ala Gly His Gin 


Met 








155 






160 


i 


165 


Pro 


He 


Thr Asp 


He 


Ala 


Thr Glu Pro 


Ala 


Gin Gly Gin Asp 


Cys 








170 






175 




180 


Val 


Ala 


Asp Met 


Val 


T?hr 


Ala Asp Asp 


Ser Gly Leu Leu Cys 


Val 






Ser Gly 


185 


Glu 




190 




195 


Trp 


Arg 


Pro 


Phe Thr p Leu 


Leu 


Thr Arg He Pro 


Gly 








200 






205 




210 


Phe 


Gly 


Val~Pr t o 


Cys 


Pro 


Ser Val Gin 


Leu Trp Gin Gly lie 


He 








215 






226 




225 


Ala 


Ala 


Gly Tyr 


Gly Asn 


Gly Gin Val 


His 


Leu Tyr Glu Ala 


Thr 








230 


i 




235 




240 


Thr 


Gly 


Asn Leu 


His 


Val 


Gin He Asn 


Ala 


His Ala Arg Ala 


He 








245 






250 


255 


Cys 


Ala 


Leu Asp 


Leu 


Ala 


Ser Glu Val 


Gly 


Lys Leu Leu-*Ser 


Ala 








260 






265 




270 


Gly 


Glu 


Asp Thr 


Phe 


Val 


His He Trp 


Lys 


Leu Ser Arg Asn 


Pro 








275 






280 




285 


Glu 


Ser 


Gly Tyr 


He 


Glu 


Val Glu His 


Cys 


His Gly Glu Cys 


Val 








290 






295 




300 


Ala 


Asp 


Thr Gin 


Leu 


Cys 


Gly Ala Arg 


Phe 


Cys Asp Ser Ser 


Gly 








305 






310 


315 


Asn 


Ser 


Phe Ala 


Val 


Thr 


Gly Tyr Asp 


Leu 


Ala Glu He Arg Arg 








320 






325 




330 


Phe 


Ser 


Ser Val 










i 





I 



! 

<210> 12 
<211> 569 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 323914 9 

<400> 12 



Met Asp 


Pro 


Ala 


Glu 


Ala 


Val 


Leu 


Gin 


Glu 


Lys 


Ala 


Leu 


Lys 


Phe 


1 






5 










10 






15 


Met Asn 


Ser 


Ser 


Glu 
20 


Arg 


Glu 


Asp 


Cys 


Asn 
25 


Asn 


Gly 


Glu 


Pro 


Pro 
30 


Arg Lys 


He 


He 


Pro 


Glu 


Lys 


Asn 


Ser 


Leu Arg 


Gin 


Thr Tyr Asn 








35 










40 










45 


Ser Cys 


Ala 


Arg 


Leu 


Cys 


Leu 


Asn 


Gin 


Glu 


Thr 


Val 


Cys 


Leu 


Ala 








50 










55 








60 


Ser Thr 


Ala 


Met 


Lys 
65 


Thr 


Glu 


Asn 


Cys 


Val 
70 


Ala 


Lys 


Thr 


Lys 


Leu 
75 


Ala Asn 


Gly Thr 


Ser 


Ser 


Met 


He 


Val 


Pro 


Lys 


Gin 


Arg 


Lys 


Leu 








80 










85 










90 


Ser Ala 


Ser 


Tyr 


Glu 
95 


Lys 


Glu 


Lys 


Glu 


Leu 
100 


Cys 


Val 


Lys 


Tyr 


Phe 
105 
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Glu 


Gin 


Trp 


Ser 


Glu 


Ser Asp 


Gin 


vaj. 


biU 


fne 


Vdl 


Glu His 


Leu 








110 










llo 








120 


He 


Ser 


Gin 


Met 


Cys 
125 


His 


Tyr 


Gin 


His 


Gly 
130 


nlS 


xxe 


Asn Ser 


Tyr 
135 


Leu 


Lys 


Pro 


Met 


Leu 


Gin 


Arg 


Asp 


Phe 


He 


Thr 


Ala 


Leu Pro 


Ala 








140 










145 








150 


Arg 


Gly 


Leu 


Asp 


His 


He 


Ala 


Glu 


Asn 


He 


Leu 


Ser 


Tyr Leu Asp 




155 










160 








165 


Ala 


Lys 


Ser 


Leu 


Cys 


Ala 


Ala 


Glu 


Leu 


Val 


Cys 


Lys 


Glu Trp 


Tyr 








170 








i 


175 








180 


Arg 


Val 


Thr 


Ser Asp Gly Met 


Leu Trp ( Lys 


Lys 


Leu 


He Glu 


Arg 








185 










190 








195 


Met Val Arg Thr Asp Ser Leu Trp Arg Gly Leu Ala Glu Arg Arg 










200 










205 








210 


Gly Trp 


Gly 


Gin 


Tyr 


Leu 


Phe 


Lys 


Asn 


Lys 


Pro 


Pro Asp Gly Asn 








215 










220 








225 


Ala 


Pro 


Pro 


Asn 


Ser 


Phe 


Tyr Arg 


Ala 


Leu 


Tyr 


Pro 


Lys He 


He 










230 










235 








240 


Gin Asp 


He 


Glu 


Thr 


He 


Glu 


Ser 


Asn 


Trp Arg 


Cys 


Gly Arg 


His 










245 










250 








255 


Ser 


Leu 


Gin 


Arg 


He 


His 


Cys 


Arg 


Ser 


•31u 


Thr 


Ser 


Lys Gly Val 








260 








265 








270 


Tyr 


Cys 


Leu 


Gin 


Tyr 


Asp 


Asp 


Gin 


Lys 


He 


Val 


Ser 


Gly Leu 


Arg 








275 










280 








285 


Asp 


Asn 


Thr 


He 


Lys 


lie 


Trp 


Asp 


Lys 


Asn 


Thr 


Leu 


Glu Cys 


l Lys 








290 










295 








300 


Arg 


He 


Leu 


Thr 


Gly 


His 


Thr 


Gly 


Ser 


Val 


Leu 


Cys 


Leu Gin 


Tyr 








305 










310 








315 


Asp 


Glu 


Arg 


Val' 


He 


He 


Thr 


Gly 


Ser 


Ser 


Asp 


Ser 


Thr Val 


Arg 






320 










325 








330 


Val 


Trp Asp 


Val 


Asn 


Thr 


Gly 


Glu 


Met 


Leu 


Asn 


Thr 


Leu He 


His 










335 










340 








345 


His 


Cys 


Glu 


Ala 


Val 


Leu 


His 


Leu 


Arg 


Phe 


Asn 


Asn 


Gly Met 


Met 








350 










355 








360 


Val 


Thr 


Cys 


Ser 


Lys 


Asp Arg 


Ser 


He 


Ala 


Val 


Trp 


Asp Met 


Ala 








365 










370 








375 


Ser 


Pro 


Thr 


Asp 


He 


Thr 


Leu Arg Arg 


Val 


Leu 


Val 


Gly His 


Arg 








380 




! 






385 








390 


Ala 


Ala 


Val 


Asn 


Val 


Val 


Asp 


Phe 


Asp 


Asp 


Lys 


Tyr 


He Val 


Ser 










395 








400 








405 


Ala 


Ser 


Gly Asp Arg 


Thr 


He 


Lys 


Val 


Trp 


Asn 


Thr 


Ser Thr 


Cys 










410 










415 








420 


Glu 


Phe 


Val 


Arg 


Thr 


Leu 


Asn 


Gly 


His 


Lys 


Arg 


Gly 


He Ala 


Cys 








425 










430 








435 


Leu 


Gin 


Tyr 


Arg 


Asp Arg 


Leu 


Val 


Val 


Ser 


Gly 


Ser 


Ser Asp 


Asn 






440 










445 








450 


Thr 


He 


Arg 


Leu 


Trp Asp 


He 


Glu 


Cys 


Gly Ala 


Cys 


Leu Arg 


Val 








455 










4 60 








465 


Leu 


Glu 


Gly 


His 


Glu 


Glu 


Leu 


Val 


Arg 


Cys 


He 


Arg 


Phe Asp 


Asn 








470 










475 








480 


Lys 


Arg 


He 


Val 


Ser Gly Ala 


Tyr Asp 


Gly 


Lys 


He 


Lys Val 


Trp 






485 










490 








495 


Asp 


Leu 


Val 


Ala 


Ala 


Leu 


Asp 


Pro Arg 


Ala 


Pro 


Ala 


Gly Thr 


Leu 








500 








505 








510 


Cys 


Leu 


Arg 


Thr 


Leu 


Val 


Glu 


His 


Ser 


Gly Arg 


Val 


Phe Arg 


Leu 






515 










520 








525 


Gin 


Phe 


Asp 


Glu 


Phe 


Gin 


He 


Val 


Ser 


Ser 


Ser 


His 


Asp Asp 


Thr 








530 










535 








540 


He 


Leu 


He 


Trp 


Asp 


Phe 


Leu 


Asn 


Asp 


Pro 


Ala 


Ala 


Gin Ala 


Glu 








545 










550 








555 


Pro 


Pro 


Arg 


Ser 


Pro 


Ser 


Arg 


Thr 


Tyr 


Thr 


Tyr 


He 


Ser Arg 










560 










565 
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<210> 13. 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 3315936 

i 

<400> 13 



Met 


Glu 


Phe 


Leu 


Glu 


He 


Gly Gly 


Ser 


Lys 


Pro 


Phe 


Arg 


Ser 


Tyr 


1 








5 










10 










15 


Trp 


Glu 


Met 


Tyr 


Leu 


Ser 


Lys 


Gly 


Leu 


Leu 


Leu 


He 


Phe 


Val 


Val 










20 










25 










30 


Asp 


Ser 


Ala 


Asp 


His 


Ser 


Arg 


Leu 


Pro 


Glu 


Ala 


Lys 


Lys 


Tyr 


Leu 










35 










40 










45 


His 


Gin 


Leu 


He 


Ala 


Ala 


Asn 


Pro 


Val 


Leu 


Pro 


Leu 


Val 


Val 


Phe 










50 










55 










60 


Ala 


Asn 


Lys 


Glr> 


Asp 


Leu 


Glu 


Ala 


Ala 


Tyr 


His 


He 


Thr Asp 


He 










65 










70" 










75 


His 


Glu 


Ala 


Leu 


Ala 


Leu 


Ser 


Glu 


Val 


Gly Asn 


Asp 


Arg 


Lys 


Met 










80 


i 








85 










90 


Phe 


Leu 


Phe 


Gly 


Thr 


Tyr 


Leu 


Thr 


Lys 


Asn 


Gly 


Ser 


Glu 


He 


Pro 










95 










100 










105 


Ser 


Thr 


Met 


Gin 


Asp 


Ala 


Lys 


Asp 


Leu 


He 


Ala 


Gin 


Leu 


Ala 


Ala 










110 










115 










120 


Asp 


Val 


Gin 



























i 



<210> 14 
<211> 1957 
<212> DNA 

<213> Homo sapiens ' , 

<220> 

<221> misc-feature ' 
<223> Incyte Clone 016108 

<400> 14 

atttttgtca ctttctgtgt gaactaaagt gattcaatgt ctcttttgga ttgcttctgt 60 

acttcaagaa cacaagttga atcactcaga cctgaaaaac agtctgaaac cagtatccat 120 

caatacttgg ttgatgagcc aaccctttcc tggtcacgtc catccactag agccagtgaa 180 

gtactatgtt ccaccaacgt ttctcactat gagctccaag tagaaatagg aagaggattt 240 

gacaacttga cttctgtcca tcttgcacgg catactccca caggaacact ggtaactata 300 

aaaattacaa atctggaaaa ctgcaatgaa gaacgcctga aagctttaca gaaagccgtg 360 

attctatccc actttttccg gcatcccaat attacaactt attggacagt tttcactgtt 420 

ggcagctggc tttgggttat ttctccattt atggcctatg gttcagcaag tcaactcttg 480 

aggacctatt tccctgaagg aatgagtgaa actttaataa gaaacattct ctttggagcc 540 

gtgagagggt tgaactatct gcaccaaaat ggctgtattc acaggagtat taaagccagc 600 

catatcctca tttctggtga tggcctagtg accctctctg gcctgtccca tctgcatagt 660 

ttggttaagc atggacagag gcatagggct gtgtatgatt tcccacagtt cagcacatca 720 

gtgcagccgt ggttgagtcc agaactactg agacaggatt tacatgggtt atatgtgaag 780 

tcagatattt acagtgttgg gatcacagca tgtgaattag ccagtgggca ggtgcctttc 840 

caggacatgc atagaactca gatgctgtta cagaaactga aaggtcctcc ttatagccca 900 

ttggatatca gtattttccc tcaatcagaa tccagaatga aaaattccca gtcaggtgta 960 

gactctggga ttggagaaag tgtgcttgtc tccagtggaa ctcacacagt aaatagtgac 1020 

cgattacaca caccatcctc aaaaactttc tctcctgcct tctttagctt ggtacagctc 1080 

tgtttgcaac aagatcctga gaaaaggcca tcagcaagca gtttattgtc ccatgttttc 1140 

ttcaaacaga tgaaagaaga aagccaggat tcaatacttt cactgttgcc tcctgcttat 1200 

aacaagccat caatatcatt gcctccagtg ttaccttgga ctgagccaga atgtgatttt 1260 

cctgatgaaa aagactcata ctgggaattc tagggctgcc aaatcatttt atgtcctata 1320 
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tacttgacac tttctccttg ctgctttttc 
tatacttgaa aatacagttg gtgcactgga, 
aggggcacca gtttgtagtc cctctgtttc 
aattactaat ctagctagtg tcatttattc 
tttttatctc tcaatataat ttttgagcca 
tgggaatggg ccctcagagg acagtgcttc 
tttttaatga gctattgtta aaccaacagg 
gtagagggaa aatgtttgtg ctttcccttt 
actgagcctc actcacatta aatgattcac 
tttttttaaa aaagggggct aaagtaacac 
attcacgcac cccgtgggag ctcaataaag 



ttctgtattt ctaggtacaa ataccagaat 1380 
gaatctatta tttaaaacca ctctgttcaa 1440 
gcacagagta ctatgacaag gaaacatcag 1500 
tggaattttt ttctaagctg tgactaactc 1560 
gttaattttt ttcagtattt tgctgtccct 1620 
caagtacatc ttctcccaga ttctctggcc 1680 
ctagtttatc ttacatcaga cccttttctg 1740 
ttcttctgtt aatacttatg gtaacaccta 1800 
ttgaaatata tacagaaatt gtaatttgct 1860 
tttcctactt atgtaaatta tagatcctaa 1920 
atttact 1957 



<210> 15 

<211> 2545 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc-feature 

<223> Incyte Clone 640521 



<400> 15 

gagccgagct gggggcgcag agcgcgggag 
ctgcagtgca gtggtgcgat ctcggctcag 
tgattctcct gcctcagcct cccgagtagc 
ggtggctccg ctgccggatg ggagtgcccc 
atgtcatcat gtgtctctag ccagcccagc 
gggggcaggg gcagcagcag cagcgaaagc 
tcatccttga gcatccacct gggcatggag 
ggctgtgctg tggacctcgg cttggcgcgg 
gtcccccttg actcctccgg gtcccaggcc 
ctgcaagagc ggtcccaggg tgggctggca 
tgcatctgcc cgtccctgcc ctactcaccc 
ccccggcggc cgacagtgga gtctcaccac 
cagctgaatc agtataccct gaaggatgaa 
ttggcctaca atgaaaatga caatacctac 
ctgatccggc aggccggctt tccacgtcgc 
ggaggctgca tccagcccag gggccccatt 
aagaagctgg accaccccaa tgtggtgaag 
gaccatctgt acatggtgtt cgaactggtc 
ctcaaaccac tctctgaaga ccaggcccgt 
gagtacttac actaccagaa gatcatccac 
ggagaagatg ggcacatcaa gatcgctgac 
gacgcgctcc tctccaacac cgtgggcacg 
gagacccgca agatcttctc tgggaaggcc 
tactgctttg tctttggcca gtgcccattc 
aagatcaaga gtcaggccct ggaatttcca 
gacctgatca cccgtatgct ggacaagaac 
aagctgcacc cctgggtcac gaggcatggg 
tgcacgctgg tcgaagtgac tgaagaggag 
ttggcaaccg tgatcctggt gaagaccatg 
gagggcagcc ggcgggagga acgctcactg 
ggcagcgaag acaacctcca gggcaccgac 
ttgtgagagg cagtccctgc gtggaaagtt 
gcatgcatcc actgcggccg gaggaggcca 
acctcgcatg cgcgccgcgt cgcctctggg 
tgtcctacgg aaacccagca cgtgtgtaga 
tttcctttgt tgttttaaag gggacaaaaa 
tcgaccgtgg ccgctggctg gctggacagg 
cgtgcatttt gggacaattg ctttttaaaa 
ttttcagaac cacgtcagat ataccaagtg 
ggcgagcaga aaactccggc ggtctgaggc 
gtagggggct agatgtggct cctagtgcaa 



gcggcggcgg cgcggaccca gtcacccagg 60 
tcattgcaac cttcacctcc c'ggattcaag 120 
tgggattaca ggtgcccacc accatgccca 180 
agtgtgctgg atgaagctgg cgcatgcacc 240 
agcaaccggg ccgcccccca ggatgagctg 300 
cagaagccct gtgaggccct gcggggcctc 360 
tccttcattg tggtcaccga gtgtgagccg l 420 
gaccggcccc tggaggccga tggccaagag 480 
cggccccacc tctccggtcg caagctgtct 540 
gccggtggca gcctggacat gaacggacgc 600 
gtcagctccc cgcagtcctc gcctcggctg 660 
gtctccatca cgggtatgca ggactgtgtg 720 
attggaaagg gctcctatgg tgtcgtcaag 780 
tatgcaatga aggtgctgtc caaaaagaag 840 
cctccacccc gaggcacccg gccagctcct 900 
gagcaggtgt accaggaaat tgccatcctc 960 
ctggtggagg tcctggatga ccccaatgag 1020 
aaccaagggc ccgtgatgga agtgcccacc 1080 
ttctacttcc aggatctgat caaaggcatc 1140 
cgtgacatca aaccttccaa cctcctggtc 1200 
tttggtgtga gcaatgaatt caagggcagt 1260 
cccgccttca tggcacccga gtcgctctct 1320 
ttggatgttt gggctatggg tgtgacacta 1380 
atggacgagc ggatcatgtg tttacacagt 1440 
gaccagcccg acatagctga ggacttgaag 1500 
cccgagtcga ggatcgtggt gccggaaatc 1560 
gcggagccgt tgccgtcgga ggatgagaac 1620 
gtcgagaact cagtcaaaca cattcccagc 1680 
atacgtaaac gctcctttgg gaacccattc 1740 
tcagcgcctg gaaacttgct cacgaagcaa 1800 
ccgccccccg tgggggagga ggaagtgctc 1860 
gctgggcccc cgcccccggc tcccccgcac 1920 
tggagcccga gtagctgcct ggatcgctcg 1980 
gggctgctgc accgcgtttc catagcagca 2040 
gcctcgatcg tcatctctgg ttatttgttt 2100 
aaaaaaaaaa aaggacttga ctccatgacg 2160 
cgggtgtgag gagttgcaga cccaaaccca 2220 
cgtttttatg ccaaaaatcc ttcattgtga 2280 
actgtgtgtg gggtttgaca actgtggaaa 2340 
catggaggtg gttgctgcat ttgagaggga 2400 
accggaaacc atggcacctt ccagagccgt 24 60 
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ggtctcaagg agteagagca gggctggccc tcagtagctg cagggagctt tgatggcaac 2520 
ttattttgtt aagaagggtt ttttt 1 2545 



<210> 16 

<211> 3034 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc- feature 

<223> Incyte Clone 1250171 



<400> 16 

tcctgagtct 

gtataacaag 

ttttatctga 

aaactggtga 

atgttcaaat 

tatttgctat 

catgtgggag 

ctgaattctt 

gtatagtgca- 

agtctgtgta 

ttgtttctct 

taagaaaaga 

cattttacca 

ataaagaagt 

agaaagcaga 

ctcggggtct 

aaaagtgttg 

taattcatgt 

atactgctac 

aagaacttat 

tccctaaaac 

taggattaat 

gggatgctag 

gtaccttact 

ttaaagatga 

tctgtatcag 

tggaagcggc 

gttctcaggg 

gatcactggc 

aaaaactaca 

aagcagaacg 

tgtattacca 

aggcattttt 

tattatcgct 

atactaatga 

aacataccat 

tgaagaaatt 

ttttagaaag 

agctttctaa 

ttaaatgaat 

tgtaaatatg 

gatgtaattt 

gagaagaaag 

taaataagtt 

catccttaca 

tgtgagtggg 

ataaagtggc 

taatttcaga 

atttttttcc 

gttttaagct 



cgaggaggcc 
aggattgcct 
tattttaggc 
tttatttgct 
gagagaattt 
tgaagaggag 
tttatacact 
aattgttttg 
ccgtgatatc 
caaactcaca 
gtatggcaca 
tcatcagaag 
tgcagctact 
gatgtataaa 
aaatggacca 
tcaggttcta 
gggttttgac 
tttttcgcta 
tatatttcat 
ctacgaaggg 
tactgaggaa 
atatgaaaaa 
catggctaag 
gctttatcag 
ttacaatgaa 
aaacattgaa 
agagttaggt 
aacaatagaa 
agacgcatgg 
agtcctgtta 
tagattagct 
tgccacaaaa 
gaataagtca 
gactaatcag 
gttacaagaa 
gaccccaatt 
aaaggaagag 
gtttggctct 
tagaagttta 
gcctttatag 
tacaatattg 
tatcttttaa 
ccatgaccga 
attttctctg 
aatcaggaag 
gagcagaacc 
attattttat 
tgaattatta 
tttggccata 
gtatatttct 



gcgggagccc 
gatccagcca 
caaggagc'ta 
atcaaagtat 
gaagtgttga 
acaacaacaa 
gttttagaag 
cgagatgtgg 
aagccaggaa 
gattttggtg 
gaagaatatt 
aaatatggag 
ggatcactgc 
ataattacag 
attgactgga 
cttacccctg 
cagttttttg 
caacaaatga 
gaactggtat 
cgacgcttag 
aaccctatat 
atttccctcc 
gcaataacag 
gaattaatgc 
actgttcaca 
aaaactgtga 
gaaatttcag 
accagtcttc 
gcacatcaag 
aattgcatga 
tataatgaag 
gctatgacgc 
gaagaatgga 
tgttttgata 
actctgcctc 
tatccaagtt 
atggaagggg 
ttaaccatgg 
agaaaagttt 
atagtcactt 
taaatacata 
catttataat 
ccaatatgtt 
accgcctact 
actgacttga 
gcaccactgt 
tttacaaggt 
agcaaacatt 
aatgtgtaat 
ttaattctgc 



gccggcggtg 
agatgcagag 
ctgcaaacgt 
ttaataacat 
aaaaactcaa 
gacataaagt 
aaccttctaa 
tgggtggaat 
atatcatgcg 
cagctagaga 
tgcaccctga 
caacagttga 
catttagacc 
gaaagccttc 
gtggagacat 
ttcttgcaaa 
cagaaactag 
cagctcataa 
ataaacaaac 
tcttagaacc 
ttgtagtaag 
ctaaagtaca 
gggttgtgtg 
gaaaggggat 
aaaagacaga 
aagtatatga 
acatacacac 
aggatatcga 
aaggcactca 
cagagattta 
aacaaatcca 
actttacaga 
taagaaagat 
ttgaagaaga 
agaaaatgtt 
ctaacacatt 
tggttaaaga 
atggtggcct 
ccgtttgcac 
gtttctacaa 
aaaaatatac 
tatatgagga 
gacatactga 
ggaaatattt 
cacgtttgta 
tatactggga 
gcccagatcc 
ttaaagtgaa 
tgtcattaaa 
ttactatttc 



gcgcggcgga 
cacttctaat 
ctttcgtgga 
aagcttcctt 
tcacaaaaat 
acttattatg 
tgcctatgga 
gaatcatcta 
tgttataggg 
attagaagat 
tatgtatgag 
tctttggagc 
ctttgaaggg 
tggtgcaata 
gcctgtttct 
catccttgaa 
tgatatactt 
gatttatata 
caaaattatt 
tggaaggctg 
ccgggaacct 
tccacgttat 
ttatgcctgc 
acgatggctg 
agttgtgatc 
aaagttgatg 
caaattgttg 
cagcagatta 
tccgaaagac 
ctatcagttc 
caaatttgat 
tgaatgtgtt 
gcttcatctt 
agtatcaaaa 
tacagcttcc 
agtagaaatg 
acttgctgaa 
tcgcaacgtt 
aagaaaataa 
ttcagtattt 
aaatttttgg 
aatttgacct 
tcctctactc 
ttaagtggaa 
aatggtagaa 
taacaatttt 
cagttatcct 
ttcattatta 
attctaaggt 
atggaaaaaa 



gacccggctg 
catctgtggc 
agacataaga 
cgtccagtgg 
attgtcaaat 
gaattttgtc 
ctaccagaat 
cgagagaatg 
gaagatggac 
gatgagcagt 
agagcagtgc 
attggggtaa 
cctcgtagga 
tctggagtac 
tgcagtcttt 
gcagatcagg 
caccgaatgg 
catagctata 
tcttcaaatc 
gcacaacatt 
ctgaatacca 
gatttagacg 
agaattgcca 
attgaattaa 
acattggatt 
aagatcaacc 
agactttcca 
tctccaggtg 
agaaatgtag 
aaaaaagaca 
aagcaaaaac 
aaaaagtatg 
aggaaacagt 
tatcaagaat 
agtggaatca 
actcttggta 
aataaccaca 
gactgtcttt 
cgcttgggca 
gatgtggtcg 
ctgctgtgaa 
cagtgatcac 
tgagtggggc 
ccaaaatagg 
cggtggctac 
tttgagaagg 
tgtatccatg 
aaaactattc 
catttcaact 
ataaatttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
i900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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caattttaaa aaattttttt ataaaaaaaa aaaa 3034 



<210> 17 

<211> 1337 f , 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1911587 

<400> 17 ' 1 ' ■ 

gaaagctgtg ggaccatcct ggcaaccccg gtgtttggct gggttctagc gtaccgtctg 60 

tgtggccggt gggggacctg cggtcggagt gggagggcca gtctgcaccc aagaggtgga 120 

agaggacggg ctttaggctg gaagcgcctt agaggagcca tttttccagg tggggcccca 180 

ggcagaggct ccgacaggga' gcctggccat agtcgcgcac caggggaggt ggagcgcgtc 240 

ccagacccga gcccccgacc tcagccaaac ccattccttc tgtccttgga ggccagaggg 300 

gactctgagc atcggaaagc aggatgcctg gtttgctttt atgtgaacca acagagcttt 360 

acaacatcct gaatcaggcc acaaaactct ccagattaac agaccccaac tatctctgtt 420 

tattggatgt ccgttccaaa tgggagtatg acgaaagcca tgtgatcact gcccttcgag 480 , 

tgaagaagaa aaataatgaa tatcttctcc cggagtctgt ggacctggag tgtgtgaagt 540 

actgcgtggt gtatgataac aacagcagca ccctggagat actcttaaaa gatgatgatg 600 

atgattcaga ctctgatggt gatggcaaag atcttgtgcc tcaagcagcc attgagtatg 660 

gcaggatcct gacccgcctc acccaccacc ccgtctacat cctgaaaggg ggctatgagc 720 

gcttctcagg cacgtaccac tttctccgga cccagaagat catctggatg cctcaggaac 780 

tggatgcatt tcagccatac cccattgaaa tcgtgccagg gaaggtcttc gttggcaatt 840 \ 

tcagtcaagc ctgtgacccc aagattcaga aggacttgaa aatcaaagcc catgtcaatg 900 

tctccatgga tacagggccc ttttttgcag gcgatgctga caggcttctg cacatccgga 960 

tagaagattc cccggaagcc cagattcttc ccttcttacg ccacatgtgt cacttcattg 1020 

aaattcacca teaccttggc tctgtcattc tgatcttttc cacccagggt atcagccgca 1080 

gttgtgccgc catcatagcc tacctcatgc atagtaacga gcagaccttg cagaggtcct 1140 

gggcctatgt caagaagtgc aaaaacaaca tgtgtccaaa tcggggattg gtgagccagc 1200 

tgctggaatg ggagaagact atccttggag attccatcac aaacatcatg gatccgctct 1260 

actgatcttc tccgaggccc accgaagggt actgaagagc ctcacctggg ggcattttgt 1320 

gggtggaggg ccagagt , 1337 



<210> 18 

<211> 1639 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2079081 

<40O> 18 

gacaaaagcc agacacattt caacatgagg 
gtatacaaga acctaaagga gttttctcaa 
tctgttcttc agcaaagggc aaacctggaa 
gcaagcaagc tgagcaaagc attacagaac 
gcctgggcct cagagggaat gaaatccaca 
attgaattgg aagcaataaa accgacttat 
aaatcacttg acaatgaagt tgaaaagaca 
caaattaagg ccaagaagaa attaatggtt 
cttgtagaaa gctccaagca atctatgact 
ctgacaaaat caactgaaaa gttggaaaag 
gcgggttatt ctacqagact gaaatgggaa 
ctggagctgg agaaggaaag aattcaactt 
catatttctc tttttggcca aaccctgacc 
agcaagattg acattgaaaa agatatccag 
acagaaaaca aatctgagtt cctgttaacg 
atggataaag agagacgaaa gtctttacta 



gacccactga cagattgtcc gtataataaa 60 
aatggagaga atttctgcaa acaggtcaca 120 
attagctatg ccaaaggact tcagaaactg 180 
acgagaaaaa gttgtgttag cagtgcctgg 240 
gcggacctgc atcaaaaact tggcaaagca 300 
caagtcctaa atgtacaaga gaagaagaga 360 
gcaaatcttg tcattagcaa ctggaatcag 420 
agtaccaaga aacatgaagc acttttccag 480 
gagaaggaga agcggaagct cctcaataaa 540 
gaagatgaaa attactacca aaaaaacatg 600 
aacacactag agaactgcta ccagagcatt 660 
ttatgcaata acttaaacca gtacagccaa 720 
acatgccaca cgcagattca ctgtgccatc 780 
gctgtaatgg aagaaactgc aattttatct 840 
gattactttg aagaagatcc taacagtgca 900 
aaaccaaaat tattgagact gcagagagac 960 
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attgaaaaag 
acctcctcct 
aatttgaaac 
cttgagcaaa 
aaggagcata 
aatattgtga 
cctggtgcag 
caagatgatg 
gaaggatggt 
gaggagttac 
atactacctt 
cttaccttta 



cctcaaaaga 
tctctgatgc 
tagacctttt 
gacctcaacc 
ctcatagcta 
gcaaggcatc 
cccagctcag 
agttgaattt 
ggtttggatc 
cttcaaatgc 
cacactcggt 
a'aaaaaaaa 



caaggaaggc 
aaagagccag 
ggaagcgaac 
cagccatcct 
tgtgaaaata 
ttctggtggg 
cagcagactt 
ggaaaagggt 
tttgaatggg 
tggcaacaca 
aatcaacaat 



ctggaacgaa 
aaagacacag 
tcctacaaac 
tgtagtaa^t 
tctcggcctt 
cagagcaatc 
tgcaaggcct 
gacattgtga 
aaaaaaggcc 
gctacaaagg 
acagtgtggt 



tgcttaaaac 
1 cagcgttaat 
tgtcatcaat 
ccatcttcag 
ttttaatgaa 
caggttcttc 
tgtattcttt 
ttatacacga 
attttcctgc 
cataaaacaa 
tcaaataaga 



gtactccagc 
ggatgagaac 
gttagcagaa 
gtggagggaa 
gagattagag 
aactccagcc 
tcaagccagg 
gaaaaaagaa 
cgcttatgtg 
gactctgaac 
ataaagtgct 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
\1620 
1639 



<210> 19 

<211> 1504 . i 

<212> DNA ( j' 

<213> Homo sapiens < 

<220> t 
<221> misc-feature 

<223> Incyte Clone 2472655 

i 

<400> 19 

cgaaatcgta ggacttccga aagcagcggt 
tgcgcgaaga tgcgaaaggt ggttttgatc 
ctctgcaagc ggctgctggc ggaagatgat 
atgagcaagg cagaagctgt ctgtgctgct 
accattgtcc aggtggatgt cagcaacctg 
aagcaaaggt ttcagagatt agactgtata 
caactaaata tcaaagcact tttctttggc 
tccacagctg aaggcctgct gacccagggt 
gtgtttgaga ccaatgtctt tggccatttt 
tgtcacagtg acaatccatc tcagctcatc 
aatttcagcc tcgaggactt ccagcacagc 
tatgccactg accttttgag tgtqgctttg 
tccaatgtgg cctgtccagg tacagcattg 
tttatatgga cgctgttgat gccggcaata 
actttgacac catataatgg aacagaagct 
tctctcaatc ctctgatcaa atatctgagt 
atgacccaga agatggacct agatgaagac 
gaactggaaa agcacattag ggtcactatt 
ggctcatgcc tataattcca gcactttggg 
ggagttcaag accagcctga gaaacatagt 
ataatagctg ggtgtggtgg catgcgcatg 
gaggatctct tgaggctggg aggcagaggt 
ccagcctggg tgacagcgag accctgtctc 
aaccagagct gacaatgaca ctctggaaca 
tggatttcta ctgagttgga taatatgcat 
aaaa 



i 



ggcgtttgct tcactgcttg gaagtgtgag 60 
accggggcta gcagtggcat tggcctggcc 120 ? 
gagcttcatc tgtgtttggc gtgcaggaac 180 
ctgctggcct: ctcaccccac tgctgaggtc 240 
cagtcggtct tccgggcctc caaggaactt 300 
tatctaaatg ctgggatcat gcctaatcca 360 
ctcttttcaa gaaaagtgat tcatatgttc 420 
gataagatca ctgctgatgg acttcaggag 480 
atcctgattc gggaactgga gcctctcctc 540 
tggacatcat ctcgcagtgc aaggaaatct 600 
aaaggcaagg aaccctacag ctcttccaaa 660 
aacaggaact tcaaccagca gggtctctat 720 
accaattt^a catatggaat tctgcctccg 780 
ttgctacttc gcttttttgc aaatgcattc 840 
ctggtatggc ttttccacca aaagcctgaa 900 ' 
gccaccactg gctttggaag aaattacatt 960 
actgctgaaa aattttatca aaagttactg 1020 
caaaaaacag ataatcaggc caggctcagt 1080 
aggccaaggc agaaggatca cttgagacca 114 0 
gagcccttgt ctctacaaaa agaaataaaa 1200 
tagtcccagc tactcagaag gatgaggtgg 1260 
tgcagtgagc tgagattgtg ccactgcact 1320 
aaaatatgta tatatttaat atatatataa 1380 
ttgcatacct tctgtacatt ctggggtaca 1440 
ttgtaataaa ctatgaacta tgaaaaaaaa 1500 

1504 



<210> 20 

<211> 3096 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2948818 

<400> 20 

gggtgttctt ataacttaag ttcagttttt tctttcctgt gaggaagagg cagtttttta 60 
aatgaagcca tctctgggga aatcgtattg attgttgtag ctaaatacgg aatttttaaa 120 
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gtctttagta tgttgaactg gaaatatagg acatgcgttt gagatctact gtgagttgca 180 
tcataaatac aaaggactga agttataaaa ,gagaaaagag aagtttgctg ctaaaatgaa 240 
tctgagcaat atggaatatt ttgtgccaca cacaaaaagg tactgaggat ttacccccca 300 
aaaaaaattg tcaatgagaa ataaagctaa ctgatatcaa aaagcagagc ctgctctact 360 
ggccatcatg cgtaaagggg tgctgaagga cccagagatt gccgatctat cctacaaaga 420 
tgatcctgag gaacttttta ttggtttgca tgaaattggg catggaagtt ttggagcagt 480 
ttattttgct acaaatgctc acaccagtga ggtggtggca attaagaaga tgtcctatag 540 
tgggaagcag acccatgaga aatggcaaga tattcttaag gaagttaaat ttttacgaca 600 
attgaagcat cctaatacta ttgagtacaa aggctgttac ttgaaagaac acactgcttg 660 
gttggtgatg gaatattgct taggctcagc ctctgattta ttagaagttc ataaaaaacc 720 
acttcaggaa gtggagatcg ctgccattac tcatggagcc ttgcatggac tagcctacct 780 
acattctcat gcattgattc atagggatat taaagcagga aatattcttc taacagagcc 840 
aggtcaggta aaactagctg attttggatc tgcttcaatg gcttctcctg ccaactcctt 900 
cgtgggcaca ccttactgga tggctccaga ggtgatctta gctatggatg aaggacagta 960 
tgatgggaaa gttgatattt ggtcacttgg catcacttgt attgaattgg cggaacggaa 1020 
gccgcccctt ttcaacatga atgcaatgag tgccttatat cacattgccc agaatgactc 1080 
cccaacgtta cagtctaatg aatggacaga ctccttta^g agatttgttg attactgctt 1140 
gcagaaaata cctcaggaaa ggccaacatc agcagaacta ttaaggcatg actttgttcg 1200 
acgagaccgg ccactacgtg tcctcattga cctcatacag aggacaaaag atgcagttcg' 1260 
tgagctagat aacctacagt accgaaaaat gaaaaaaata cttttccaag agacacggaa 1320 
tggacccttg aatgagtcac aggaggatga ggaagacagt gaacatggaa ccagcctgaa 1380 
cagggaaatg gacagcctgg gcagcaacca ttccattcca agcatgtccg tgagcacagg 1440 
cagccagagc agcagtgtga acagcatgca ggaagtcatg gacgagagca gttccgaact 1500 
tgtcatgatg cacgatgacg aaagcacaat caattccagc tcctccgtcg tcycataagaa 1560 
agatcatgta ttcataaggg atgaggcggg ccacggcgat cccaggcctg agccgcggcc 1620 
tacccagtca gttcagagcc aggccctcca ctaccggaac agagagcgct ttgccacgat 1680 
caaatcagca tctttggtta cacgacagat ccatgagcat gagcaggaga acgagttgcg 1740 
ggaacagatg tcaggttata agcggatgcg gcgccagcac cagaagcagc tgatcgccct 1800 
ggagaacaag ctgaaggctg agatg^acga gcaccgcctc aagctacaga aggaggtgga 1860 
gacgcatgcc aacaactcgt ccatcgagct ggagaagctg gccaagaagc aagtggctat 1920 
catagaaaag gaggcaaagg tagctgcagc agatgagaag aagttccagc aacagatctt 1980 
ggcccagcag aagaaagatt tgacaacttt cttagaaagt cagaagaagc agtataagat 204.0 
ttgtaaggaa aaaataaaag aggaaatgaa tgaggaccat agcacaccca agaaagagaa 2100 
gcaagagcgg atctccaaac ataaagagaa cttgcagcac acacaggctg aagaggaagc 2160 
ccaccttctc actcaacaga gactgtacta cgacaaaaat tgtcgtttct tcaagcggaa 2220 
aataatgatc aagcggcacg aggtggagca gcagaacatt cgggaggaac taaataaaaa 2280 
gaggacccag aaggagatgg agcatgccat gctaatccgg cacgacgagt ccacccgaga 2340 
gctagagtac aggcagctgc acacgttaca gaagctacgc atggatctga tccgtttaca 2400 
gcaccagacg gaactggaaa accagctgga gtacaataag aggcgagaaa gagaactgca 2460 
cagaaagcat gtcatggaac ttcggcaaca gccaaaaaac ttaaaggcca tggaaatgca 2520 
aattaaaaaa cagtttcagg acacttgcaa agtacagacc aaacagtata aagcactcaa 2580 
gaatcaccag ttggaagtta ctccaaagaa tgagcacaaa acaatcttaa agacactgaa 2640 
agatgagcag acaagaaaac ttgccatttt ggcagagcag tatgaacaga gtataaatga 2700 
aatgatggcc tctcaagcgt tacggctaga tgaggctcaa gaagcagaat gccaggcctt 2760 
gaggctacag ctccagcagg aaatggagct gctcaacgcc taccagagca aaatcaagat 2820 
gcaaacagag gcacaacatg aacgtgagct ccagaagcta gagcagagag tgtctctgcg 2880 
cagagcacac cttgagcaga agattgaaga ggagctggct gcccttcaga aggaacgcag 2940 
cgagagaata aagaacctat tggaaaggca agagcgagag attgaaactt ttgacatgga 3000 
gagcctcaga atgggatttg ggaatttggt tacattagat tttcctaagg aggactacag 3060 
atgagattaa attttttgcc atttacaaaa aaaaaa 3096 



<210> 21 

<211> 1527 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 054191 

<400> 21 

ctccgcttga ggagaagcgc caagtgcgca tggggacgct atagcaattc gtttgctgtc 60 
cttcctctcc ttcgaagatg acaaggccta ccatcgtttc ttcctgcctt tgggccgtca 120 
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ggcagttggt tgggacccgc tccaaccctc ( 
tgtcatggct actctgagtg ttataggttc 
taatatacag aaatctccag agataagacc 
cctgggactt tgctggctca cggagacact 
catcatctgc tataacctac aagcagttgg 
caccgtcaat tacatctggt atttgtacac 
acagagcaca tctccactgg tgatagatta 
tttctcaagc ctgatacctc tgctattgat 
tgaatgtttc caaaacttca gtcagagcca 
agccatggct gaacttccac cttctgccaa 
tggtatcgcc attttcctgg gcagctttgt 
tatccgagcc cagacattgt ataagaagtt 
acagtgggca gtgattcaca ttgtggacca 
ttgctgctgg ggcccagctg tcattctaat 
caagcttcac atggcccttt atgttctcca 
caactgtgga gtatatggct ggacgcaqfca 
gcgtgatgca gatacccaga caccattatt 
cttaaattca ctggaatcca ccctgacttt 
ctacaatact ggaacatcca ggaactggag 
ttgggaaagg acatctrtaaa tcttttctaa 
atatagtgcc attgttagta gtcattctag 
cagattcact agtgatcctt aaagtctcta 
gagatagact cgtgttacct tgatggatat 
taaattctag attattaaaa aaaaaaa 



ggttcttcct gcaatacagt ggatacaatt 180 
aagttcactt attgcctatg ctgtattcca 240 
acttttttat ctgagcttct gtgacctgct 300 
tctctatgga gcttcagtag caaataagga 360 
acagatattc tacattjtcct catttctcta 420 
agagctgagg atgaaacaca cccagagtgg 480 
tacttgtcga gttggtcaaa tggcctttgt 540 
gacacctgta ttctgtctgg gaaatactag 600 
caagtgtatc ttgatgcact caccaccatc 660 
cacatctgtc tgtagcacac tttattttta 720 
actcagcctc cttaccatta tggtcttact 780 
tgtgaagtca actggctttc tggggagtga 840 
acgggtgcgc, ttctacccag tggccttctt 900 
gatcataaag ctgactaagc cacaggacac 960 
ggctctaacg gcaacatctc agggtctact 1020 
caaattcqac caactaaagc aggaggctcg 1080 
atgctcacag aagagattct atagcagggg 1140 
tcctgccagt acttctacca ttttttgaaa 1200 
ttattctacg ctaatggatt ggaaagaatg 1260 
ctatgcccta aactgcagaa ctcaaaggaa 1320 
atgaattggg agtatctctc cagttattcc 1380 
ttcagggaga ggaagacact ttccatctca 14 40 
tggatttgtc taagtctctt ctagaaaaaa 1500 

1527 



<210> 22 

<211> 2948 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1403604 



<400> 22 

aaagaaaggt cagccgcaag cgaabttagc 
cgagatgcgc tcttcccgga agtgacgcac 
atgggtggtt tttttccccc taccgaggtc 
cacaaccgtg acagactgcc attcctgagt 
agcgtcgggc aggcgcggta cagaacacca 
tggccgcgcg cgggtacgtg caggacccct 
gccgcgcacc gctcattcac cgaggctact 
tgcgcgcttt tttggagcag attggcgcgc 
ctctcggcgc tggcttcgac tcgctctatt 
gggctgcagt ctgggaggtg gattttccgg 
gagagacgcc agagctgtgc gcgttaaccg 
cgctgtgctt tgagagcgca gactactgca 
gagtggagga ggccctgggc gccgcggggc 
ccgaggcggt gctgacctac ctcgagccgg 
cccagcgttt tcctaatgcc cttttcgtgg 
ttggccagtt catgctgcaa cattttcggc 
gttttcctga cgtggaggcg cagcggcgcc 
gtgccgtgga cataaatgaa ttctatcact 
tggaaaatat tgaacccttt gacgaatttg 
tcattctggc agcttctagg ggagacaccc 
aggcatttcc tcgcgtaaat cctgcttcgc 
gtagcgaggg ccaggtccca aacctgaaga 
cagacgttat tctcagtgca ggaggatttg 
gccagtttca cttgctctca agagattgtg 
gttgtgggac tggagttcag tgggatggac 
agagtcgggt tctggttctg ggagggagac 
tccagcttca tttttttaag agtgaggata 
caaaggctgg ccgaaaggat gattccactt 



actggctada ccctcctcaa ttctggttgg 60 
aagtgccggc ggaaggggaa gtccaggagc 120 
cgtgaggtgt gtgctaacca aggggcggct 180 
ctcttctggc catgggcccc cggagccgtg 24 0 
acgacagcag cgccctcagc aagcgttccc 300 
ttgccgcgtt gctggttccg ggcgcggcgc 360 
acgtccgcgc acgcgccgtg aggcactgcg 420 
cccaggccgc gcttcgcgcg cagatcttgt 480 
ttcgcttaaa aaccgcgggc cgcctggccc 54 0 
acgtggcgcg gcgcaaagca gaaaggattg 600 
ggcctttcga gaggggggag cccgcgtccg 660 
tcctgggtct ggacttgcgg cagctccagc 720 
tcgacgcagc ctcacccact ctgctcctgg 780 
agagtgccgc ggccctcatc gcctgggcag 840 
tctatgagca gatgaggcct caagacgcct 900 
agctaaactc ccccctgcat ggcctggagc 960 
gcttccttca agctggctgg accgcctgcg 1020 
gctttcttcc cgcagaagaa cgccggcggg 1080 
aggagtggca tctgaagtgc gcccattatt 1140 
tctcccacac cctagtgttt ccatcctcag 1200 
cttcaggggt attccctgcc agcgtagtca 1260 
gatatggcca cgcctctgtc ttcttgagcc 1320 
gagagcagga ggggcggcac tgccgagtga 1380 
actctgaatg gaaaggcagc caaataggca 14 40 
gcctttatca caccatgaca agactctcag 1500 
tgtccccagt aagtccagcc ttgggggttc 1560 
ataacactga ggacctgaaa gtgacaataa 1620 
tgtgttgttg gcggcattca acaacagaag 1680 
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tgtcctgtca 
tactaagtga 
agggagaagt 
ttattgctgg 
caatctcttg 
actcccacac 
attcctcctc 
atcagattga 
ttcttcctga 
gtacctactt 
aaggactgga 
atgaccctct 
actattcagt 
actacagaaa 
taattttcta 
ttcccaatcc 
gtggttgaat 
tatagttgag 
tcacagtttg 
aacaggaaac 
ttccatattt 
aaaaaaaa 



gaatcaggaa 
ctggcatttc 
acctgaagcc 
aggtctcggg 
tggattcctc 
agctcatgtg 
atttcctgga 
cacaacatat 
agagcaacag 
caacccccat 
ctaatattca 
agctatagat 
gcaaaaagtg 
acagctgaag 
atctaagttc 
caggccttgt 
atatttgaga 
ggagcgtagt 
attaaaacat 
acttgtagga 
attttagattt 



tatttgtttg 
ctccatgtag 
cggcattctc 
gcttctgagg 
tgggagtcag 
ctcaatggaa 
gtgactgtga 
gtgccatggc 
ctcctgctcc 
, acagtcacat 
ggacccacta 
actgccactc 
aaaaaggttg 
acagtggcca 
gtttattgag 
gatttgagat 
tggtacctta 
lattgacaatt 
tgtcctgttt 
cttcctttac 
ataagtgtca 



tgtatggggg 
ggacaatggc 
acagtgcctg 
agccattgaa 
tagacatcca 
agctgttact 
tcaatttgac 
cattaatgtt 
ttggaggtgg 
tagacctttc 
aagtagacaa 
ctcctttccc 
gtaaaatagcj 
tgcagtccga 
ttacagtg^t 
ggtapcttag 
gaaaaagttt 
cttcatgtag 
gttg'caacag 
caacatactt 
ataaagcaaa 



tcgaagcgtg 
ttgggtcagg 
cacttggcaa 
ctctgtgctc 
gcctcccatt 
ggttggaggg 
tacaggattg 
acacaaccat 
tgggaactgc 
ttccttaagt 
taaagttttc 
catccttttt 
taaaatacct 
gaggagtagt 
ctttagtaaa 
aaaaagttac 
cacgcagatc 
gaaacctgaa 
aaaactcgga 
tttaaatgtt 
cttttgatgc 



gtggaacctg 
atcccagtgg 
gggggagccc 
tttctgagac 
accccaaggt 
atctggattc 
agctctgagt 
actagtatcc 
ttttcctttg 
gctgggcagt 
cacaaatagg 
ttcccttagc* 
agaaacaatc 
ggtctgcctc 
gtaaaacaat 
acgcagttcc 
cttggttgaa 
atgaacacag 
tagttttaac 
ttgctattgg 
ctcaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
294Q 
2948 



<210> 23 

<211> 1808 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> mis c- feature 

<223> Incyte Clone 1652936 



<400> 23 

gagagtatta 

atacgatttg 

acagaaaaaa 

acgatgtcac 

agccatggct 

gctaggcagc 

cttcggcatc 

agcagttacc 

cagttccgat 

ttcttcacca 

gctgactgtg 

gctcggaaga 

ctgcaggcag 

tcactgccgg 

atggctgacc 

agtactatca 

tccaacagtc 

gccatgaaaa 

agagcctcct 

ggattgttac 

gtgtcggagg 

agtccagcaa 

ttcagaccag 

gcttctgcgg 

aaggccattc 

aacagtgagc 

caattggaac 

gggtgacctt 

gatgtttgac 

tgtatagact 



aatgtgaatt 
ggaacccagc 
attccttcaa 
ccctcgcctc 
acagcaaact 
cccttgtcca 
aggcttccct 
ccgacgaacc 
cccttcagcc 
aatcaaagct 
atggagccct 
acggctaccc 
cttttcctaa 
caaatcaaca 
ttcctgtccc 
atgaagcctt 
tcaaagcaag 
ggggcgagga 
ccttggatct 
ccccaccacc 
ccgagttact 
agaaggatgt 
aaaaccaagc 
caaccatgaa 
aaactgctat 
tccagcagca 
aacttcgtcc 
gtctgccaag 
tgtgtcaaaa 
taaaaaaaaa 



gcctctgcct 
tgagctgcat 
aagaatggac 
ccctccttct 
gcggagcagc 
gcccgaggga 
tatccggtcc 
ctggaggata 
agacccaagc 
ttccattcca 
gaccctgcct 
attgcctgag 
gcccaaatgg 
acctcgtgac 
taaccaggat 
accaaaggac 
accaagatcc 
ccctcccacc 
gaataaagtc 
tgcgctccct 
cccacagctg 
actgtattct 
tacagaaaac 
accgcatcca 
ccgcaaaaat 
gctcaaggag 
ggtcactgtg 
atctttcttt 
gctgtgagca 
aaaatagatc 



cgctttatga 
ggaactaagg 
gatgaggata 
tccccgcdtc 
gcagaacaga 
tcctcatcag 
ttttcagtgg 
acagaagaac 
tctttcattt 
gaactctcct 
gagttctgtg 
ggcctccctc 
gactgtcaat 
ttgaatcgga 
gtaactagtg 
gtgtctgagg 
agatcttact 
ccgccacctc 
ttccagccca 
ccaagacctt 
agcagagccc 
cagccaccat 
caagagcctt 
acagtccaaa 
aaagaggcaa 
gttcatcaag 
ttgtgacccc 
tgaatgtttt 
gcaaaatata 
tttaattaag 



tgtcaaagaa 
ttcagattcc 
aacaggaaac 
attaccagag 
tgcatccagc 
ggggcccagg 
agagggaact 
agcgcgagta 
caggttctgt 
atatatggga 
ctgcgtttca 
caactctgca 
tatttgattc 
tggagacatc 
atgacaaaca 
atccagcaac 
ctagcacctc 
ggccacagaa 
gtgtgccagc 
gtccatcaca 
catcccaggc 
caaagcccat 
ccactgctgc 
agcagtcttc 
acgcagtgct 
aacgaattgc 
cccatggttc 
gaacccaact 
atccatatga 
cggtcgcaag 



tgatggtgag 
atatttaact 
acagtctccc 
ggtgcccttg 
accttatgaa 
aaccaagccc 
acaggataac 
ctatgtcaat 
ggccaagaac 
gcttagtgat 
tctcattgtg 
gccagaatac 
ttattctgag 
tgttaaagac 
agctttgaaa 
tcccaaggat 
catagaagag 
aacccattcc 
taccaagtca 
gtctgaacaa 
tgcagaaagt 
tcgtaggaaa 
aagtgggcca 
caaacagaag 
ggctcggctg 
attggaaaac 
aagtgacagt 
acttgtcata 
ccttttctct 
cttattccct 



60 
120 
180 
240 
300. 
36d 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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ttagtgag , 1808 



<210> 24 

<211> 1148 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1710702 

<400> 24 

tgcgtacgtg cgtcgtctct atggtggcgg. 
tcaggcgagc cgatctgggg ctgcaggatg 
ggctcggccg ccgccctgtg caacaacctc 
acgtattttg gcgtggttca tggaccaagc 
gtgcccttgg cccagcgcca gctccacgct 
atcactcagg tccactggtg tgtcctcccc 
cgaggaatac agatgtacga gtccaatggc 
tctggagatg cctccccagt acaggctgtg 
ttcatctgtg tgggaacgtg gtcaggccgg 
cccaacattg tactgagcga ggagctggct 
accgagcctg cccagggaca ggattgtgtg 
ttgctgtgtg tctggcggtc agggccagaa 
ggagttccgt gcccctctgt gcagctgtgg 
ggacaagtgc atctatatga ggccactaca 
gcccgggcca tctgcgccct ggacctggct 
gaggacacct ttgtgcatat ctgg^agctg 
gtggaacact gtcatggtga gtgtgtcgcc 
gattcctcag gcaactcctt tgctgtgact 
agcagtgtgt gagaagagca gccttccttt 
cacccaaa 



cggatttgga gggaccctac gaaccaggag 60 
ttccgctggg aijcgctccat tcccctgcga 120 
agtgtgctgc agctgccggc tcgcaacctc 180 
gcccagcttc tcagcgctgc tcctgagggt 240 
aaggagggtg ctggagtgag tcccccactt 300 
ttccgagtgc tgctggtact cacctcacat 360 
tacaccatgg tctactggca tgcactggac 420 
tttgcccggg gaattgctgc cagtggccac 480 
gtgctggtgt ttgacatccc agcaaagggt 540 
gggcaccaga tgccaatcac agacattgcc 600 
gctgacatgg tgacggcaga tigactcaggc 660 
ttcacattat tgacccgcat tccaggattt 720 
caggggatca tagcagcagg ctatgggaac 780 
ggaaatctac atgtccagat caatgcccat 840 
tctgaggtgg gcaagctact ctctgcaggt 900 
agcagaaacc cagagagtgg ctacattgag *960 
gacacccagc tgtgtggtgc tcgattttgt 1020 
ggctatgacc ttgcggagat ccggagattc 1080 
gtccctgtgg tattcataaa gtacccgctc 1140 

1148 



<210> 25 

<211> 2419 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc-feature 

<223> Incyte Clone 323914 9 



<400> 25 

cggacgcgtg ggggcctggc accaaagggg 
tcggcgatta tggacccggc cgaggcggtg 
tcctcagaga gagaagactg taataatggc 
aattcactta gacagacata caacagctgt 
tgtttagcaa gcactgctat gaagactgag 
ggcacttcca gtatgattgt gcccaagcaa 
aaggaactgt gtgtcaaata ctttgagcag 
gaacatctta tatcccaaat gtgtcattac 
cctatgttgc agagagattt cataactgct 
gagaacattc tgtcatacct ggatgccaaa 
gaatggtacc gagtgacctc tgatggcatg 
aggacagatt ctctgtggag aggcctggca 
aaaaacaaac ctcctgacgg gaatgctcct 
aaaattatac aagacattga gacaatagaa 
cagagaattc actgccgaag tgaaacaagc 
cagaaaatag taagcggcct tcgagacaac 
gaatgcaagc gaattctcac aggccataca 
agagtgatca taacaggatc atcggattcc 
gaaatgctaa acacgttgat tcaccattgt 



cggccccggc ggagagcgga cccagtggcc 60 
ctgcaagaga aggcactcaa gtttatgaat 120 
gaacccccta ggaagataat accagagaag 180 
gccagactct gcttaaacca agaaacagta 240 
aattgtgtgg ccaaaacaaa acttgccaat 300 
cggaaactct cagcaagcta tgaaaaggaa 360 
tggtcagagt cagatcaagt ggaatttgtg 420 
caacatgggc acataaactc gtatcttaaa 480 
ctgccagctc ggggattgga tcatattgct 540 
tcactatgtg ctgctgaact tgtgtgcaag 600 
ctgtggaaga agcttatcga gagaatggtc 660 
gaacgaagag gatggggaca gtatttattc 720 
cccaactctt tttatagagc actttatcct 780 
tctaattgga gatgtggaag acatagttta 840 
aaaggagttt actgtttaca gtatgatgat 900 
acaatcaaga tctgggataa aaacacattg 960 
ggttcagtcc tctgtctcca gtatgatgag 1020 
acggtcagag tgtgggatgt aaatacaggt 1080 
gaagcagttc tgcacttgcg tttcaataat 1140 
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ggcatgatgg tgacctgctc caaagatcgt 
actgacatta ccctccggag ggtgctggtc 
tttgatgaca agtacattgt ttctgcatct 
agtacttgtg aatttgtaag gaccttaaat 
tacagggaca ggctggtagt gagtggctca 
gaatgtggtg catgtttacg agtgttagaa 
tttgataaca agaggatagt cagtggggcc 
gtggctgctt tggacccccg tgctcctgca 
cattccggaa gagtttttcg actacagttt 
gatgacacaa tcctcatctg ggacttccta 
cgttcccctt ctcgaacata cacctacatc 
acttgcccag gacccattaa agttgcggta 
aacaacagta acaatcaaac tactgcccag 
tgagactcct gttgggacac agttggtctg 
actgactgct tcagtgctgc tatcagaaga 
ttttaaacct cccctcctct cctcctttca 
cagacaaagg tgacttataa 1 atatatttag 
ggccattaag gcaggaggaa ctaggtttcc 
ccactctagg gggtaggggg gggggtgnca' 
tccccggtta actgcagggg atgtccagga 
ataggtctag gggaggggga cagagttccc 
aagggaaaca caggatttg 



<210> 26 

<211> 746 ; 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 3315936 

<400> 26 

atttaatatg actcactata gggaatttgg 
ttgcttagac tgcggcccac gtggaaggct 
aacaagcaaa tcctagtgct gggcctggat 
ctagcttcaa acagagtcca gcacagtgtg 
atcaacactg aagacagcca gatggagttc 
tcctactggg aaatgtacct atccaaggga 
gatcacagcc gattacctga agccaagaaa 
gtacttcctc tggttgtgtt tgcaaacaaa 
gatatccatg aagctttggc attatctgaa 
ggaacctacc tgactaagaa tggctcagag 
ttgattgcac agctggctgc agatgtgcag 
cgactgagat gtcatcagtg ttgaatggca 
aaaaattaag ccatttccta ttaaaa 



tccattgctg tatgggatat ggcctcccca 1200 
ggacaccgag, ctgctgtcaa tgttgtagac 1260 
ggggatagaa ctataaaggt atggaacaca ,1320 
ggacacaaac gaggcattgc ctgtttgcag 1380 
tctgacaaca ctatcagatt atgggacata 1440 
ggccatgagg aattggtgcg ttgtattcga 1500 
tatgatggaa aaattaaagt gtgggatctt 1560 
gggacactct gtctacggac ccttgtggag 1620 
gatgaattcc agattgtcag tagttcacat 1680 
aatgatccag ctgcccaagc tgaacccccc 1740 
tccacfataaa taaccataca ctgacctcat 1800 
tttaacgtat ctgccaatac caggatgagc *1860 
tttccctgga, ctagccgagg agcagggctt 1920 
cagtcggccc aggacggtct actcagcaca 1980 
tgtcttctat cttttgtgaa tgattggaac 2040 
cctctgcacc tagttttttc ccattggttc 2100 
gtgttttngc ccaggaatct ctcttgcttt 2160 
cctgtatagg gcctgcgggg ggagaggacc 2220 
gcttticaagg cccaggggcc ccaggtgtct 2280 
ccgggggggc tacgagcaag gcccggcccc 2340 
ctcgtaatag ggctcggggg agggcaggga 2400, 

2419 



i 



ccctcgagct agagattcgg gcacgagggg 60 
cttagccacc ctgcctggcc cgaggagaaa 120 
ggagcaggaa aaaccagtgt cctgcactct 180 
gcacccaccc aaggtttcca tgcagtttgc 240 
ctggagattg gtggcagtaa accttttcgg 300 
ttgctgctga tctttgtggt ggattcagca 360 
taccttcatc agctaattgc agcaaaccca 420 
caggatcttg aagcagccta tcacattaca 4 80 
gtgggaaatg acaggaagat gttcttgttt 540 
ataccctcca ccatgcaaga tgccaaagac 600 
tgaccaggac tcagcccact gtgcggctca 660 
ggcttgaagc caaaggtttc cacctcaaat 720 

746 
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